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(57) Abstract 



The present invention relates to a method for producing a nucleotide sequence construct with optimized codons for an HIV genetic 
vaccine based on a primary, early HIV isolate. Specific such nucleotide sequence construct are the synthetic envelope BX08 constructs 
The invention further relates to the medical use of such constructs for the treatment and prophylaxis of HIV through DNA vaccine and for 
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Method for producing a nucleotide sequence construct with optimised codons for an HIV 
genetic vaccine based on a primary, early HIV isolate and synthetic envelope BX08 
constructs. 

5 Field of the invention 

The invention relates to a DNA vaccine against HIV, which is designed from a clinical 
primary isolate. One aspect of the invention relates to a method of producing a nucleotide 
sequence construct, in a prefered aspect based on a cassette system, the nucleotide 
sequence construct being used as a DNA vaccine. The method can, for example, lead to the 
10 disclosed synthetic BX08 HIV-1 envelope vaccine nucleotide sequence construct, designed 
to generate suitable DNA vaccines against HIV, specifically HIV-1. Furthermore, the 
invention can be used for the production of recombinant protein antigens. 

Background of the invention 

15 There is an urgent need for new vaccine strategies against HIV. One such new promising 
strategy is called genetic immunisation or DNA vaccine (Webster et al 1997). Some of the 
advantages of a DNA vaccine against HIV is the induction of Th cell activation, induction of 
antibodies also against conformational dependent epitopes, and the induction of cellular 
immunity. So far, most DNA vaccine envelope genes tried, have been from tissue culture 

20 adapted virus strains (Boyer et al 1997) that often differs in several aspects from primary 
clinical isolates (such as early isolates) e.g. in co-receptor usage (Choe et al 1996, Dragic et 
al 1997). 

One disadvantage in HIV envelope based DNA vaccines may be the intrinsic relatively low 
25 expression which is regulated by the Rev expression. This may prevent an optimal 
investigation of the vaccines in small animal models like mice where Rev is functioning 
suboptimally. Recently it has been shown using the tissue culture adapted HIV-1 MN strain, 
that an exchange of the HIV codon usage to that of highly expressed mammalian genes 
greatly improves the expression in mammalian cell lines and renders the HIV expression Rev 
30 independent (Haas et al 1996). Additionally, it is known that rare codons cause pausing of 
the ribosome, which leads to a failure in completing the nascent polypeptide chain and a 
uncoupling of transcription and translation. Pausing of the ribosome is thought to lead to 
exposure of the 3' end of the mRNA to cellular ribonucieases. 
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The world-wide spread of HIV-1 has presently resulted in 8,500 new infections daily and 
AIDS is now the number 1 cause of death among US males (and number 3 among US 
females) aged 25-40 years. The epidemic hot-spots now include Eastern Europe, India and 
5 South East Asia and southern Africa. The attempts to solve this world-wide problem involve 
education, prevention, treatment and vaccine development. Affordable protective vaccines 
represent the best solution to the world-wide problem of infection with HIV-1. Induction of 
virus neutralising antibodies is one of the key components in vaccine development. Several 
recombinant envelope vaccines have been tested in humans and animals, however, they 

10 seem unable to induce sufficient protection. In this respect DNA vaccination may provide a 
different and more natural mode of antigen presentation. It is hoped that the immune 
responses induced by such DNA vaccines could aid in limiting virus replication, slowing 
disease progression or preventing occurrence of disease. Unfortunately many HIV envelope 
vaccines induce only moderate levels of antibodies. This could in part be due to limitations in 

15 expression, influenced by regulation by the Rev protein and by a species-specific and biased 
HIV codon usage. Also the virus variability is considered a barrier for development of 
antibody based vaccines and thus a tool for more easy producing of closely related vaccine 
variants is needed. 

20 It has been suggested to improve the immunogenicity and antigenicity of epitopes by certain 
mutations in the envelope gene. An elimination of certain immune dominant epitopes (like 
V3) could render less immune dominant but more relevant, conserved, or hidden epitopes 
more immunogenic (Bryder et al 1999). Also elimination of certain N-linked glycosylation 
sites could improve the exposure of relevant epitopes and increase the immunogenicity of 

25 those epitopes. Thus, it is possible that elimination of the glycosylation sites in V1 and V2 
may in a more favourable way expose neutralising epitopes (Kwong et al 1998, Wyatt et al 
1998). The HIV envelope contains putative internalisation sequences in the intracellular part 
of gp41 (Sauter et al 1996). Thus it would be relevant to eliminate and/or mutate the 
internalisation signals in a membrane bound HIV envelope vaccine gene to increase the 

30 amount of surface exposed vaccine derived HIV glycoproteins as gp150. Since the antibody 
response, that is measured and calculated in titers, is improved by adding the secreted 
gp120 as opposed to adding the membrane bound form (Vinner et al 1999), it could be 
advantageous to express the vaccine as a secreted gp120 or a secreted gp140. This would 
include important parts of gp41 , such as the 2F5 neutralising linear epitope (Mascola et al 

35 1997). 
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Summary of the invention 

Our suggested solution to the problems described above is to design DNA envelope 
vaccines from a clinical primary isolate with Rev-independent high expression in mammals, 
that is built as a cassette for easy variant vaccine production. 

5 

A method of producing a nucleotide sequence construct with codons from highly expressed 
mammalian proteins based on a cassette system coding for an early, primary HIV envelope 
is described. The method comprises the steps of direct cloning of an HIV gene, derived from 
a patient within the first 12 months of infection, thereby obtaining a first nucleotide sequence; 

10 designing a second nucleotide sequence utilising the most frequent codons from mammalian 
highly expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence; redesigning the second nucleotide sequence so that restriction enzyme sites 
surround the regions of the nucleotide sequence encoding functional regions of the amino 
acid sequence and so that selected restriction enzyme sites are removed, thereby obtaining 

15 a third nucleotide sequence encoding the same amino acid sequence as the first and the 
second nucleotide sequence; redesigning the third nucleotide sequence so that the terminals 
contain convenient restriction enzyme sites for cloning into an expression vehicle; producing 
snuts between restriction enzyme sites as well as terminal snuts and introducing snuts into 
an expression vehicle by ligation. The nucleotide sequence construct obtained by this 

20 method uses the mammalian highly expressed codons (figure 1) and renders the envelope 
gene expression Rev independent and allows easy cassette exchange of regions 
surrounded by restriction enzyme sites that are important for immunogenicity, function, and 
expression. 

25 The method can, for example, lead to the disclosed synthetic, Rev-independent, clinical 
(such as early), primary HIV-1 envelope vaccine gene, built as a multi cassette. From the 
sequence of the envelope of the HIV-1 BX08 isolate (personal communication from Marc 
Girard, Institute Pasteur, Paris), the present inventors have designed a synthetic BX08 HIV-1 
envelope vaccine nucleotide sequence construct. 

30 

With the great diversity of envelopes in HIV among different patients and within one patient, 
it would be of advantage to vaccinate with several envelope variants, all being highly 
expressed. To avoid synthesising several full length envelopes, it is much easier to 
exchange relevant parts of an envelope cassette to various strains in a multivalent vaccine. 
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4 

Whether it is the disclosed synthetic BX08 nucleotide sequence construct, or any of the 
nucleotide sequence constructs obtained by the method, they are designed to generate 
suitable DNA vaccines against HIV, specifically HIV-1. In this case the mammal, preferably a 
human being, is inoculated with the nucleotide sequence construct in an expression vehicle 
5 and constitutes a host for the transcription and translation of the nucleotide sequence 
construct. The nucleotide sequence constructs of the present invention can furthermore be 
used for the production of recombinant protein antigens. In this case the nucleotide 
sequence construct is placed in an expression vehicle and introduced into a system (e.g. a 
cell-line), allowing production of a recombinant protein with the same amino acid sequence. 
10 The recombinant protein is then isolated and administered to the mammal, preferably a 
human being. The immune system of the mammal will then direct antibodies against 
epitopes on the recombinant protein. The mammal, preferably a human being, can thus be 
primed or boosted with DNA and/or recombinant protein obtained by the method of the 
invention. 

15 

A relevant HIV DNA vaccine can potentially be used not only as a prophylactic vaccine, but 
also as a therapeutic vaccine in HIV infected patients, e.g. during antiviral therapy. An HIV 
specific DNA vaccine will have the possibility to induce or re-induce the wanted specific 
immunity and help the antiviral therapy in limiting or even eliminating the HIV infection. The 

20 immunogenicity and antigenicity of epitopes in the envelope can be improved by certain 
mutations in the envelope gene. The cassette system allows for easy access to the relevant 
parts of the envelope gene, and thereby eased efforts in the process of genetic manipulation. 
Some suggested mutations are: an elimination of certain immune dominant epitopes (like 
V3); elimination of certain N-linked glycosylation sites (like glycosylation sites around V2); 

25 elimination and/or mutation of the nucleotide sequence encoding the internalisation signals in 
the cytoplasmic part of a membrane bound HIV envelope to increase the amount of surface 
exposed vaccine derived HIV glycoproteins; elimination or mutation of the cleavage site 
between gp120 and gp41; with introduced mutations in gp41 for preserving conformational 
epitopes. 

30 

Table 1 below, lists the nucleotide sequence constructs of the invention by the names used 
herein, as well as by reference to relevant SEQ ID NOs of DNA sequences, and the amino 
acid sequence encoded by the DNA sequence in the preferred reading frame. It should be 
noted, that the snut name consist of the number of the approximate position for the end of 
35 the snut and the restriction enzyme used to cleave and/or ligate that end of the snut. 
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Table 1 List of names of nucleotide sequence constructs (Snuts (S) and Pieces (P)) 
with reference to SEQ ID NO for the nucleotide sequence and protein sequence. 



Name 


Nucleotide SEQ ID 


Protein SEQ ID 




NO: 


NO: 


So-N-Lang 


1 


2 


S235EC0RV 


3 


4 


S375Pstl 


5 


6 


S495Clal 


7 


8 


S650-720ECORI 


9 


10 


SgooXbal 


11 


12 


S990Sacl 


13 


14 


Smospel 


15 


16 


Si265Xhol 


17 


18 


Si265gp120 


19 


20 


Sl265gp160 


21 


22 


Si465Pstl 


23 


24 


Si465Pstl cys 


25 


26 


Si630Xbal 


27 


28 


Si700Eagl 


29 


30 


Si890Hlndlll 


31 


32 


S2060Sacll 


33 


34 


S2190Clal 


35 


36 


S2330Pstl 


37 


38 


S2425ES 


39 


40 


Pi 


41 


42 


P 2 


43 


44 


P 3 


45 


46 


P3GV1 


47 


48 


P3 GV1V2 


49 


50 


P3GV2 


51 


52 


P4gp160 


53 


54 


P4gp150 


55 


56 


P4gp140 


57 


58 


P 5 


59 


60 


P8gp160 


61 


62 


P8gp150 


63 


64 


P8gp140 


65 


66 


synBX08-140 


67 


68 


synBX08-150 


69 


70 


synBX08-160 


71 


72 


synBX08-120 


73 


74 


synBX08-41 


75 


76 
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Detailed disclosure of the invention 

One aspect of the present invention relates to a method for producing a nucleotide sequence 
construct coding for an HIV gene. The nucleotide sequence construct is produced as a 
cassette system consisting of snuts. A snut (S) is a nucleotide sequences construct between 
5 restriction enzyme cleavage sites comprising the minimal entity of the cassette system. 

First an HIV gene is obtained from a patient within the first 12 months of infection. The term 
HIV should be understood in the broadest sense and include HIV 1 and HIV 2. It is possible 
to determine the period in which the infection has taken place with an accuracy depending 

10 on the frequency of the blood tests taken from the patient. For example, patients suffering 
from various diseases such as lack of certain factors in their blood or hepatitis have their 
blood tested on a regular basis making it possible to determine the period in which the 
infection has taken place. Apart from patients with diseases wherein blood tests are used to 
monitor the course of the disease, other groups of patients have blood tests taken, e.g. blood 

15 donors. Unfortunately, humans are still infected due to transfer of virus in blood samples, 
medical equipment, etc., making it possible to determine the date where the infection has 
taken place within the time frame of a few days. The importance of obtaining the virus early 
in the course of the infection is due to the known fact that many early isolates share the 
common feature of staying relatively constant in their envelope sequences (Karlsson et al., 

20 1998). As these early isolates may share cross-reactive antibody- and/or T-cell epitopes a 
vaccine based on such early isolates would have a better chance of inducing immune 
response to shared epitopes of the virus. It is believed that an early, directly cloned virus 
isolate will share immunogenic sites with other early virus isolates seen during an HIV 
infection, so that if a mammal generates antibodies and/or T-cells directed against these 

25 epitopes, the transferred virus will be eliminated prior to the extensive mutations that may 
occur after approximately 12 months of infection. Thus, the virus should be isolated as early 
as possible, that is within the first 12 months of infection, such as 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 
1, or 0.5 month after infection. 

30 The HIV gene for genetic vaccine is preferably cloned directly from viral RNA or from proviral 
DNA. Direct cloning in this application stands for the virus not being multiplied in stable cell 
lines in vitro. It is presently expected that passing the virus through a stable cell line will 
promote mutation in the virus gene. It is particularly preferred not to pass the virus through 
cells lines selecting for viruses with CXCR4 receptor usage. Direct cloning also includes 

35 multiplication of virus in e.g. PBMC (peripheral blood mononuclear cells) since all virus can 
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multiply in PBMC, and this type of multiplication generally does not select for CXCR4 
receptor usage. Multiplication of virus is often necessary prior to cloning. Preferably cloning 
is performed directly on samples from the patient. In one embodiment of the invention, 
cloning is performed from patient serum. The cloning is then performed directly on the HIV 
5 virus, that is on RNA. In another embodiment of the invention cloning, is performed from 
infected cells. The cloning is then performed on HIV virus incorporated in the genes in an 
infected cell (e.g. a lymphocyte), that is on DNA. In the latter case the virus might be a silent 
virus, that is a non-replicating virus. To evaluate if the virus is silent, capability of 
multiplication in e.g. PBMC is tested. 

10 

Cloning is a technique well known to a person skilled in the art. A first nucleotide sequence is 
hereby obtained. In another aspect of the invention, the first nucleotide sequence, sharing 
the properties mentioned with direct cloning, is obtained by other means. This could be from 
a database of primary isolates or the like. 

15 

Based on the first nucleotide sequence, the amino acid sequence encoded by said 
nucleotide sequence is determined. A second nucleotide sequence encoding the same 
amino acid sequence is then designed utilising the most frequent codons from highly 
expressed proteins in mammalians (e.g. figure 1 presenting the most frequent codons from 
20 highly expressed proteins in humans). 

Presently, it appears that the usage of the most frequent codons from mammalian highly 
expressed proteins has two advantages: 1) the expression is Rev independent; 2) the level 
of expression is high. The Rev independence is especially advantageous when performing 

25 experiments in mice where the Rev systems is functioning sub-optimally. For the use in 
human vaccine, Rev independence and high expression are important to increase the 
amount of antigen produced. The determination of the codons for high expression is in this 
context based on the statistics from human highly expressed proteins (Haas, Park and Seed, 
1996 hereby incorporated by reference). It is contemplated that the expression of a protein 

30 can be even higher, when current research in binding between codon (on the mRNA) and 
anticodon (on the tRNA) reveals codons with optimal binding capabilities, and when 
interactions in-between codons and/or in-between anticodons are known. 

The second nucleotide sequence designed utilising optimised codons is then redesigned to 
35 obtain a third nucleotide sequence. The purpose of the redesigning is to create unique 
restriction enzyme sites around the nucleotide sequence encoding functional regions of the 
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amino acid sequence. By having unique restriction enzyme sites around the nucleotide 
sequence encoding functional regions of the amino acid sequence, the nucleotide sequence 
encoding functional regions of the amino acid sequence can easily be isolated, changed, and 
re-inserted. Examples of functional regions of the amino acid sequence are transmembrane 
5 spanning regions, immunodominant regions, regions with antibody cross reacting domains, 
fusion domains and other regions important for immunogenicity and expression such as 
variable region 1 (V1), variable region 2 (V2), variable region 3 (V3), variable region 4 (V4) 
and variable region 5 (V5). 

10 It is important to select the restriction enzymes sites with care. By changing the second 
nucleotide sequence to insert restriction enzyme sites around the nucleotide sequence 
encoding functional regions of the amino acid sequence, the third nucleotide sequence must 
still code for the same amino acid sequence as the second and first nucleotide sequence do. 
Thus, if necessary, the second nucleotide sequence is redesigned by changing from 

15 optimised codons to less optimal codons. It is understood, that the restriction enzyme sites 
around the nucleotide sequence encoding functional regions of the amino acid sequence 
should preferably be placed in the terminal region of the nucleotide sequence encoding 
functional regions of the amino acid sequence. That is preferably outside the nucleotide 
sequence encoding functional regions of the amino acid sequence, such as 90 nucleotides 

20 away, e.g. 81, 72, 63, 54, 45, 36, 27, 21, 18, 15, 12, 9, 6,3 nucleotides away, but could also 
be inside the nucleotide sequence encoding functional regions of the amino acid sequence, 
such as 54, 45, 36, 27, 21, 18, 15, 12, 9, 6, 3 nucleotides inside the nucleotide sequence 
encoding the functional region of the amino acid sequence. 

25 The type of restriction enzyme sites allowed is determined by the choice of expression 

vector. In certain cases, the number of restriction enzyme sites is limited and it is hard, if not 
impossible, to place unique restriction enzyme sites around all the nucleotide sequences 
coding for functional regions of the amino acid sequence. This problem can be solved by 
dividing the entire nucleotide sequence into pieces, so that each piece comprises only 

30 unique restriction enzyme sites. Modifications to each of the piece is performed separately 
prior to assembly of the pieces. It is preferred that the nucleotide sequence is divided into 9 
pieces. In another aspect, the nuclotide sequence is divided into 8 pieces, or 7, or 6, or 5, or 
4, or 3, or 2 pieces. It is especially preferred that the nucleotide sequence is divided into 3 
pieces. 



35 
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Thus, the redesign of the second nucleotide sequence is an interaction between the choice 
of cloning vector, expression vector, selection of restriction enzyme sites, dividing into 
pieces, and exchange of codons to insert restriction enzyme sites. In a preferred 
embodiment of the present invention the cloning vector is Bluescript allowing the restriction 
5 enzyme sites chosen from the group consisting of: Eagl, Mlu\, EcoRV, Pst\, C/al, EcoRI, 
Xba\ t Sad Spel, Xftol, HindWl Sacll, A/ofl, BamHI, Sma\, Sa/I, Oral, Kpnl. If other cloning 
vectors are chosen, other restriction enzyme sites will be available as known by the person 
skilled in the art. 

10 As a part of the redesigning of the second nucleotide sequence, selected restriction enzyme 
sites may be removed. The selected restriction enzyme sites to be removed are those sites 
that are sites of the same type as the ones already chosen above and that are placed within 
the same piece. The removal of these restriction enzyme sites is performed by changing 
from optimised codons to less optimal codons, maintaining codons for the same amino acid 

15 sequence. 

The third nucleotide sequence is redesigned so that the terminal snuts contain convenient 
restriction enzyme sites for cloning into an expression vehicle. The expression "vehicle" 
means any nucleotide molecule e.g. a DNA molecule, derived e.g. from a plasmid, 

20 bacteriophage, or mammalian or insect virus, into which fragments of nucleic acid may be 
inserted or cloned. An expression vehicle will contain one or more unique restriction enzyme 
sites and may be capable of autonomous replication in a defined host or vehicle organism 
such that the cloned sequence is produced. The expression vehicle is an autonomous 
element capable of directing the synthesis of a protein. Examples of expression vehicles are 

25 mammalian plasmids and viruses, tag containing vectors and viral vectors such as 

adenovirus, vaccinia ankara, adenoassociated virus, cannarypox virus, simliki forest virus 
(sfv), Modified Vaccinia Virus Ankara (MVA), and simbis virus. In one embodiment of the 
invention, the expression vector contains tag sequences. In another embodiment of the 
invention a bacteria is transformed with an expression plasmid vector and the bacteria is 

30 then delivered to the patient. Preferred expression vehicles are simliki forest virus (sfv), 
adenovirus and Modified Vaccinia Virus Ankara (MVA). 

The snuts are produced by techniques well known by the person skilled in the art. The 
preferred method for synthesising snuts, is herein referred to as "the minigene approach" 
35 wherein complementary nucleotide strands are synthesised with specific overhanging 

sequences for annealing and subsequent ligation into a vector. This can be performed with 
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two sets of complementary nucleotide strands, or with three sets of complementary 
nucleotide strands. The minigene approach minimises the known PCR errors of mismatches 
and/or deletions, which may occur due to hairpins in a GC rich gene with mammalian highly 
expressed codons. In figures 10-21 , the production of a representative selection of snuts is 
5 illustrated. 

For the production of long snuts, that is snuts of more than about 240 nucleotides, the 
technique of overlapping PCR is preferred as illustrated in figure 8. Herein two nucleotide 
strands about 130 nucleotides long with an overlap are filled to obtain a double strand, which 
10 is subsequently amplified by PCR. 

For the production of multiple snuts with a length of less than about 210 nucleotides, one 
preferred technique is normal PCR. In a preferred production technique the snuts are 
synthesised with the same 5* flanking sequences and with the same 3' flanking sequences, 
15 as illustrated in figure 9. The advantages of this approach is, that the same PCR primer set 
can be used for amplification of several different snuts. 

As known by the person skilled in the art, special conditions have to be used for each 
individual PCR reaction and it should be optimised to avoid inherent problems like deletions 
20 mismatches when amplificating GC rich genes from synthetic ssDNA material. Whichever of 
the above mentioned techniques are used, it is well known by the person skilled in the art, 
that it will be necessary to correct unavoidable mismatches produced either due to the 
nucleotide strand synthesis material and/or the PCR reaction. This can be performed by site 
directed mutagenesis techniques. 

25 

After the various snuts have been produced, they are assembled into pieces and 
subsequently into the complete gene. Methods for assembly (such as ligation) are well 
known by the person skilled in the art. 

30 

In a preferred embodiment of the present invention the HIV gene encodes the entire HIV 
envelope. It is understood that the HIV envelope can be the full length envelope gp160 as 
well as shorter versions such as gp150, gp140, and gp120 with or without parts of gp41. 

35 As will be known by the person skilled in the art, the HIV is divided into several groups. 
These groups presently include group M, group O, and group N. Further, the HIV is divided 
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into subtypes A, B, C, D, E, F, G, H, I, and J. In the present invention subtype B is preferred 
due to the high prevalence of this subtype in the Western countries. 

The determination of groups and subtypes is based on the degree of nucleotide sequence 
5 identity in the envelope gene is presently defined as follows: If the sequence identity is more 
than 90% the viruses belong to the same subtype; If the sequence identity is between 80% 
and 90% the viruses belong to the same group. If the sequence identity is less than 80% the 
viruses are considered as belonging to different groups. 

10 One aspect of the invention relates to a nucleotide sequence construct in isolated form which 
has a nucleotide sequence with the general formula (I), (II), (III), or (IV) 

(I) Pl-S 4 95ClarS 6 50-720EcoRrP2-Si265gp120 

(II) Pl-S495ClarS650-720EcoRrP2*"Si265Xhor S<|465Pstr P4gp140 

(III) Pl-S 4 95ClarS650-720EcoRrP2-Sl265Xhor S<| 46 5Pstr P4gp150 

15 (IV) Pl-S495ClarS650-720EcoRrP2~Si265Xhor Si465Pstl" P4gp16(T S2060Sacir P5 

wherein Pi designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein S 49 5ciai designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
20 complementary thereto, or a nucleotide sequence with a sequence identity of at least 95% 
thereto; 

wherein S 65 o-72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

25 wherein P 2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein S 12 65g P i2o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
30 least 70% thereto; 

wherein S 12 65Xhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein S 1465 p s ti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
35 complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 
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wherein P 4gp i4o designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i6o designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S 2 o6osacii designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 

wherein P 5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
15 thereto. 

The design of the parent synthetic BX08 gp160 envelope cassette gene with its variant 
length genes gp150, gp140, gp120 is outlined in figure 2. 

20 The nucleotide sequence construct with the formula (I) 

(I) Pl-S495Cla!-S 6 50-720EcoRrP2-Si265gp120 

(visualised in figure 3) (SEQ ID NO: 73) codes for the amino acid sequence of gp120 (SEQ 
ID NO: 74). This amino acid sequence is the part of the HIV envelope that is secreted. Thus, 
it contains the immunogenic epitopes without being bound to the cell membrane. This is of 
25 particular advantage if the nucleotide sequence construct is used for production of 

recombinant antigens or for a DNA vaccine as the antibody immune response may be higher 
to secreted versus membrane bound HIV antigens. 

The nucleotide sequence construct with the formula (II) 

30 (II) Pl-S 4 95ClarS 6 50-720EcoRrP2-Si265Xhor S 146 5Pstr P4gp140 

(visualised in figure 4) (SEQ ID NO: 67) codes for the amino acid sequence of gp140 (SEQ 
ID NO: 68). This amino acid sequence encodes the gp120 and the extracellular part of the 
gp41 protein. The amino acid sequence is secreted due to the lack of the transmembrane 
spanning region. This is of particular advantage if the nucleotide sequence construct is used 
35 for production of recombinant antigens as the immunogenic and/or antigenic epitopes in the 
extracellular part of gp41 are included and is of particular advantage for a DNA vaccine as 
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the antibody immune response may be higher to secreted gp120 versus membrane bound 
HIV antigens. 

The nucleotide sequence construct with the formula (III) 

5 (III) Pl-S4g5ClarS650-720EcoRrP2-Si265Xhon S<|465Pstr P4gp150 

(visualised in figure 5) (SEQ ID NO: 69) codes for the amino acid sequence of gp150 (SEQ 
ID NO: 70). This amino acid sequence contains all of the envelope protein gp160 except the 
c-terminal tyrosin containing internalisation signals in the intracellular part of gp41. The 
membrane bound surface expression of the amino acid sequence is thereby maintained and 
10 enhanced. Mimicking the organisation of the native epitope conformation may by expected, 
making this nucleotide sequence construct of particular advantage if the nucleotide 
sequence construct is used as a vaccine. 

The nucleotide sequence construct with the formula (IV) 

15 (IV) Pl-S495ClarS 6 50-720EcoRrP2-Si265Xhor Si 46 5Pstr P4gp160~ S2060Sacir P5 

(visualised in figure 6) (SEQ ID NO: 71) codes for the amino acid sequence of gp160 (SEQ 
ID NO: 72) i.e. the entire envelope. 

The nucleotide sequence construct designated comprises the nucleotide sequence 
20 encoding the amino acid sequence in the first variable region (V1) and the amino acid 
sequence in the second variable region (V2). In one embodiment of the invention the first 
variable region is surrounded by EcoRV and Pstl restriction enzyme sites, and the second 
variable region is surrounded by Pstl and Clal restriction enzyme sites but as stated above, 
the choice of restriction enzyme sites can alter. 

25 

The nucleotide sequence construct designated S 65 o-72oecori comprises the nucleotide 
sequence encoding the amino acid sequence in the third variable region (V3). In one 
embodiment of the present invention S 6 5o-72oecori is characterised by the restriction enzyme 
sites EcoRI and Xbal in the terminals. 

30 

The nucleotide sequence construct designated P 2 comprises the nucleotide sequence 
encoding the amino acid sequence of the fourth variable and constant region (V4 and C4). In 
one embodiment of the present invention the forth variable region is surrounded by Sacl and 
Xho\ restriction enzyme sites. 

35 
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The nucleotide sequence construct designated S 12 65 gP i2o comprises the nucleotide sequence 
encoding amino acid sequence of the fifth variable and constant region (V5 and C5). 
Si 2 65gpi2o further comprises a nucleotide sequence encoding a C-terminal stop codon. 

5 The nucleotide sequence construct designated P 4gp i4o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region. P 4 gpi 4 o further 
comprises a nucleotide sequence encoding a C-terminal stop codon prior to the 
transmembrane spanning region. 

10 The nucleotide sequence construct designated P 4gp i6o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region (trans membrane 
spanning domain: TMD). In a preferred embodiment of the present invention the 
transmembrane spanning region is surrounded by Hindlll and Sacll restriction enzyme sites. 

15 The term "sequence identity" indicates the degree of identity between two amino acid 

sequences or between two nucleotide sequences calculated by the Wilbur-Lipman alignment 
method (Wilbur et al, 1983). 

The nucleotide sequence constructs with the formula (I), (II), (III), or (IV) illustrates the 
20 flexibility in the present invention. By producing a gene with the described method enables 
the production of a plethora of antigens with various immunogenic epitopes and various 
advantages for production and vaccine purposes. To further illustrate the flexibility of the 
invention, other changes and mutations are suggested below. 

25 In order to improve the immunogenicity of the nucleotide sequence constructs of the 
invention it is suggested to change the nucleotide sequence such that one or more 
glycosylation sites are removed in the amino acid sequence. By removal of shielding 
glycosylates, epitopes are revealed to the immunesystem of the mammal rendering the 
construct more immunogenic. The increased immunogenicity can be determined by an 

30 improved virus neutralisation. Changes in the nucleotide sequence such that one or more N- 
linked glycosylation sites are removed in the amino acid sequence is well known by the 
person skilled in the art. Potential glycosylation sites are N in the amino acid sequences N-X- 
T or N-X-S (wherein X is any amino acid besides P). The glycosylation site can be removed 
by changing N to any amino acid, changing X to a P, or changing T to any amino acid. It is 

35 preferred that N is changed to Q by an A to C mutation at the first nucleotide in the codon, 
and a C to G mutation at the third nucleotide in the codon. This is preferred to increase the 
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GC content in the nucleotide sequence construct. As an alternative N is changed to Q by an 
A to C mutation at the first nucleotide in the codon, and a C to A mutation at the third 
nucleotide in the codon. Preferred mutations in the synthetic BX08 envelope gene to remove 
potential N-linked glycosylation sites in V1 and/or V2 are A307C + C309A and/or A325C + 
5 C327G and/or A340C + C342A and/or A385C + C387A and/or A469C + C471A. Examples 
of such changes is illustrated in SEQ ID NOs: 47, 49, and 51. 

For historical reasons the HIVs have been divided into syncytia inducing strains and non 
syncytia inducing strains. The assay to determine whether a strain is syncytia inducing is 

10 described in Verrier et al 1997, hereby incorporated by reference. It is presently known, that 
viruses utilising the CXCR4 co-receptor are syncytia inducing strains. It is also, at the 
present, known that the binding site for the CXCR4 involves the third variable region (V3). In 
a preferred embodiment the nucleotide sequence construct is changed to create a binding 
site for the CXCR4 co-receptor. It is presently performed in the third variable regions, 

1 5 preferably by the mutation G865C + A866G. 

It is well established that the HIV envelope comprises immunodominant epitopes. An 
immunodominant epitope is an epitope that most antibodies from the mammal are directed 
against. The antibodies directed against these immunodominant epitopes may have little 

20 effect in elimination of the virus. It is therefore anticipated that modification of the 

immunodominant epitopes will induce antibodies directed against other parts of the envelope 
leading to a better elimination and neutralisation of the virus. By modification is understood 
any change in the nucleotide sequence encoding an immunodominant epitope in the amino 
acid sequence such that said amino acid sequence no longer contains an immunodominant 

25 epitope. Thus, modification includes removal of the immunodominant epitope and decrease 
of immunogenicity performed by mutagenesis. In a preferred embodiment of the present 
invention an immunodominant epitope in the third variable region (V3) is modified, such as 
deleted or altered. In a much preferred embodiment the nucleotides 793-897 are deleted. 
In yet another preferred embodiment of the present invention an immunodominant epitope 

30 has been removed from gp41 , such as deleted. This is performed in P 7 or P 8 by elimination 
of the nucleotides 1654-1710. 

It is anticipated that when gp120 is dissociated from gp41 in a vaccine or antigen, two 
immunodominant epitopes, one on each protein, are exposed and antibodies are directed 
35 against these in the mammal. In the infectious virus, gp120 is coiled on top of gp41 and the 
gp120/gp41 is most likely organised in a trimer, so that these immunodominant epitopes are 
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hidden and therefore less elimination of virus is observed. By removing the cleavage site 
between gp41 and gp120 a full length gp160, gp150, or gp140 can be obtained with a 
covalent binding between gp41 and gp120. Removal of the cleavage site between gp41 and 
gp120 is preferably performed by a mutation at position C1423A. An example of such a 
5 mutation is illustrated in the mutation of Si 26 5Xhoi (SEQ ID NO: 17) to S 12 65g P i6o (SEQ ID NO: 
21). 

In order to stabilise the full length gp160, gp150, and gp140 for example when the cleavage 
site between gp41 and gp120 has been removed as described above, cysteins can be 
10 inserted, preferably inside the gp41 helix creating disulphide bounds to stabilise a trimer of 
gp41s. In a preferred embodiment of the present invention the cysteins are inserted by the 
mutation 1618:CTCCAGGC:1625 to 1618:TGCTGCGG:1625. An example of such a change 
is illustrated in SEQ ID NO: 25. 

15 The above mentioned decrease in immunodominant epitopes combined with the increase in 
immunogenicity of the other epitopes is expected to greatly enhance the efficacy of the 
nucleotide sequence construct as a vaccine. 

During the production of the nucleotide sequence construct, it is convenient to ligate the 
20 snuts into pieces. The pieces, as described above, are characterised by their reversible 
assembly as there are no duplicate restriction enzyme sites. In a preferred embodiment one 
piece (herein designated P 3 ) contains P 1f S495ciai»S65o-720EcoRi> and P2. Another piece (herein 
designated P 8 ) contains S 12 66Xhoi. S 146 5Psti> and P 4gp i6o- Yet another piece (herein designated 

P7) Contains S 12 65Xhoi, S 146 5Pstli P4gp160, S2060Sacll, ^d P 5 . 

25 

One advantage of the present nucleotide sequence construct is the easy access to 
exchange and alterations in the content and function of the nucleotide sequence and the 
encoded amino acid sequence. In one embodiment the nucleotide sequence coding for a 
functional region or parts thereof of the amino acid sequence is repeated. The repeat could 
30 be back-to-back or a functional region or parts thereof could be repeated somewhere else in 
the sequence. Repeated could mean two (one repetition) but could also be three, six, or nine 
repeats. In a much preferred embodiment the repetition nucleotide sequence codes for 
amino acids in the third variable region. 



35 



In order to improve the protective capabilities of the invention against infections with HIV, 
one embodiment of the invention relates to the combination of epitopes. The present 
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nucleotide sequence construct allows insertion of one or more new nucleotide sequences 
isolated from another group and/or subtype of HIV and/or isolated from another patient. 
Hereby a vaccine or antigen with two or more epitopes from two or more HIVs is obtained. In 
a preferred embodiment, the V3 is replaced by the new nucleotide sequence. In a much 
5 preferred embodiment, the new nucleotide sequence codes for amino acids in the third 
variable region of a different HIV isolate. 

In order to improve the efficacy of the vaccine, aiming at raising cellular immunity, a 
nucleotide sequence coding for a T-helper cell epitope is included in the nucleotide 

10 sequence construct. The nucleotide sequence coding for a T-helper cell epitope or a T- 

helper cell epitope containing amino acid sequence can be put in anywhere in the nucleotide 
sequence construct as long as it does not interact with the function of the envelope molecule. 
However, it is preferably placed in the tail of the nucleotide sequence construct or between 
the leader sequence and the envelope gene. The T-helper epitopes are preferably selected 

15 from core proteins such as P24gag or from a non-HIV pathogen such as virus, bacteria, e.g. 
BCG antigen 85. For a therapeutic vaccine an HIV helper epitope is preferred since the 
patient is already primed by the HIV infection. For a prophylactic vaccine, a T-helper cell 
epitope from a frequently occurring non HIV pathogen such as Hepatitis B, BCG, CMV, EBV 
is preferred. Also, since the synthetic BX08 envelope genes may contain T-helper cell 

20 epitopes in addition to important antibody epitopes, the synthetic BX08 vaccine genes can be 
mixed with other DNA vaccines to improve the efficacy of the other DNA vaccine. 

One aspect of the present invention relates to individualised immunotherapy, wherein the 
virus from a newly diagnosed patient is directly cloned, the envelope or subunits 

25 corresponding to snuts or pieces is produced with highly expressed codons, inserted into any 
of the nucleotide sequence constructs described above and administered to the patient as a 
vaccine. Hereby a therapeutic DNA vaccine is obtained, that will help the patient to break 
immunetolerance or induce/reinduce an appropriate immune response. In one embodiment 
the variable regions of the virus are produced with highly expressed codons and exchanged 

30 into any of the nucleotide sequence constructs described above. 

In one embodiment of the invention, the nucleotide sequence construct as described above 
satisfies at least one of the following criteria: 

a) serum extracted from a Macaque primate which has been immunised by administration of 
35 an expression vector containing the nucleotide sequence construct is capable of eliminating 
SHIV as determined by quantitative PCR and/or virus culturing. 
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b) serum extracted from a primate which has been immunised by administration of an 
expression vector containing the nucleotide sequence construct is capable of neutralising 
HIV-1 BX08 and /or other HIV-1 strains in vitro. 

c) serum, extracted from a mouse which has been immunised by administration of an 

5 expression vector containing the nucleotide sequence construct four times in intervals of 
three weeks and boosted after 15 weeks, is capable of decreasing the concentration of HIV- 
antigen in a culture of HIV, serum or PBMCs by at least 50%. An example of such procudure 
is shown in example 9. 

10 In one embodiment of the invention, the nucleotide sequence construct of the invention, is 
used in medicine. That is, it is used as a vaccine, for the production of a recombinant protein, 
such that the recombinant protein is used as a vaccine, or the nucleotide sequence construct 
or the recombinant protein is used in a diagnostic composition. 

Thus, the nucleotide sequence construct of the invention can be used for the manufacture of 
15 a vaccine for the prophylactics of infection with HIV in humans. 

Intramuscular inoculation of nucleotide constructs, i.e. DNA plasmids encoding proteins have 
been shown to result in the generation of the encoded protein in situ in muscle cells and 
dendritic cells. By using cDNA plasmids encoding viral proteins, both antibody and CTL 

20 responses were generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain reaction, respectively. 
The standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the DNA therapeutics of this invention. While standard techniques 
of molecular biology are therefore sufficient for the production of the products of this 

25 invention, the specific constructs disclosed herein provide novel therapeutics which can 
produce cross-strain protection, a result heretofore unattainable with standard inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be introduced to a vaccine recipient will depend on the 
strength of the transcription and translation promoters used in the DNA construct, and on the 

30 immunogenicity of the expressed gene product. In general, an immunologically or 

prophylactically effective dose of about 10 jug to 300 jig is administered directly into muscle 
tissue. Subcutaneous injection, intradermal introduction, impression through the skin, 
inoculation by gene gun preferably DNA coated gold particles, and other modes of 
administration such as intraperitoneal, intravenous, peroral, topic, vaginal, rectal, intranasal 

35 or by inhalation delivery are also contemplated. It is also contemplated that booster 
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vaccinations are to be provided. It is further contemplated that booster vaccinations with 
recombinant antigens are to be provided, administered as described above. 

The DNA may be naked, that is, unassociated with any proteins, adjuvants or other agents 
5 which impact on the recipients immune system. In this case, it is desirable for the DNA to be 
in a physiologically acceptable solution, such as, but not limited to, sterile saline or sterile 
buffered saline. Alternatively, the DNA may be associated with surfactants, liposomes, such 
as lecithin liposomes or other liposomes, such as ISCOMs, known in the art, as a DNA- 
liposome mixture, (see for example WO93/24640) or the DNA may be associated with and 
10 adjuvant known in the art to boost immune responses, such as a protein or other carrier. 
Agents which assist in the cellular uptake of DNA, such as, but not limited to, calcium ions, 
detergents, viral proteins and other transfection facilitating agents may also be used to 
advantage. These agents are generally referred to as transfection facilitating agents and as 
pharmaceutical^ acceptable carriers. 

15 

Those skilled in the field of molecular biology will understand that any of a wide variety of 
expression systems may be used. A wide range of suitable mammalian cells are available 
from a wide range of sources (e.g. the American Type Culture Collection, Rockland, Dm; 
also, see e.g. Ausubel et al. 1992). The method of transformation or transfection and the 
20 choice of expression vehicle will depend on the host system selected. Transformation and 
transfection methods are described e.g. in Ausubel et al 1992; expression vehicles may be 
chosen from those provided e.g. in P.H. Pouwels et al. 1985. 

In one embodiment of the present invention the protein encoded by the nucleotide sequence 
25 construct is produced by introduction into a suitable mammalian cell to create a stably- 
transfected mammalian cell line capable of producing the recombinant protein. A number of 
vectors suitable for stable transfection of mammalian cells are available to the public e.g. in 
Cloning Vectors: A Laboratory manual (P.H. Pouwels et al. 1985); methods for constructing 
such cell lines are also publicly available, e.g. in Ausubel et al. 1992. 

30 

Standard reference works describing the general principles of recombinant DNA technology 
include Watson, J.D. et al 1987; Darnell, J.E. et al 1986; Old, R.W. et al, 1981; Maniatis,T. et 
al 1989; and Ausubel et al.1992. 
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Figure legends 

The invention is further illustrated in the following non-limiting examples and the drawing 
wherein 

5 Figure 1 provides the codon preference of highly expressed proteins in human cells. 

Figure 2 illustrates the outline of gp120, gp140, gp150, and gp160 encoding synthetic genes 
derived from the wild type sequence at the top. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain 
10 (TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 
The approximate position of the three restriction enzyme sites dividing the full-length 
gp160 gene into the three pieces each containing only unique restriction enzyme sites are 
shown in bold. 

15 Figure 3 building of the synthetic gp120 gene. Variable (V) and constant (C) regions are 

shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 4 building of the synthetic gp140 gene. Variable (V) and constant (C) regions are 
20 shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 5 building of the synthetic gp150 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
25 TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 6 building of the synthetic gp160 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

30 

Figure 7 illustrates the codons coding amino acids in general 
Figure 8 illustrates how overlapping PCR is performed. 
35 Figure 9 illustrates how PCR using conserved flanking ends is performed. 
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Figure 10 illustrates how S 126 5Xhoi is produced using complementary strands (minigene- 
approach) technology. The S 1265X hoi is ligated from three sets of complementary strands 
into the vector pBluescript KS + between restriction enzyme sites Xho\ and Pst\. 

5 

Figure 1 1 illustrates how S 1465 psti is produced. The same approach, as the approach used for 
the production of S 1265 xhoi, was used except that only two sets of complementary strands 
were used. 

10 Figure 12 illustrates the assembly of Pl The S 0 -N-Lan g and S 23 5ecorv are ligated into the Xba\ 
and Pst\ site of the S 37 5Psti containing plasmid. 

Figure 13 illustrates the assembly of P 2 . The S 900 xbai was excerted by Hind\\\ and Sad from 
its plasmid and ligated with S 99 osaci (Sac\-Spe\) into the S 110S pai plasmid that was opened 
1 5 at the HindWl and Spel sites. 

Figure 14 illustrates the assembly of P 3 . S 49 5ciai (C/al-EcoRI) and S 65 o-72oecori (EcoR\-Xba\) 
and P 2 (Xba\-Xho\) were ligated simultaneously into the P^ plasmid opened at the C/al 
and Xho\ sites to obtain the P 3 plasmid. 

20 

Figure 15 illustrates the assembly of P 4gp i6o. S 18 90Hindiii (Sacl-/-//>?dlll) and S 170 oeagi (Hind\\\- 
Eag\) were ligated simultaneously into the S 1630 xhai plasmid opened by Sad! and Eagl. 

Figure 16 illustrates the assembly of P 5 . S 2190 ciai (Cla\-Pst\) and S 233 opsti (Pst\-EcoR\) were 
25 ligated into the S 2425 e s plasmid opened by C/al and EcoRI. 

Figure 17 illustrates the assembly of P 8gp i6o- S 1465P sti (Xba\-Pst\) and S 1265 xhoi (Pst\-Xho\) were 
ligated into the P 4gp16 o plasmid opened by Xba\ and Xho\. 

30 Figure 18 illustrates the assembly of P 8gp i5o. S 1465 p s ti (Xba\-Pst\) and S 1265X hoi (Pst\-Xho\) were 
ligated into the plasmid containing P 4gp i 5 o with the stop codon. P 4gp i 5 o plasmid was opened 
at the Xba\ and Xho\ sites for the ligation. 

Figure 19 illustrates the assembly of P 8gp i 40 . S 146 5Psti (Xba\-Pst\) and S 1265 xhoi (Pst\-Xhol) were 
35 ligated into the plasmid containing P 4gp i 40 with a stop codon. P 4gp140 plasmid was opened 
at the Xba\ and Xho\ sites for the ligation. 
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Figure 20 illustrates the assembly of P 8gP 4i. Two complementary nucleotide strands 
1265gp41S and 1265gp41AS designed with overhang creating a 5' Xho\ and a 3' Pst\ 
restriction enzyme site were anealed and iigated into the piece 8 which is already opened 
5 at the Xho\ and Pst\ sites whereby S 12 65 is deleted. 

Figure 21 illustrates the assembly of P 7 . Ps (Xhol-SacU) and S 2 o6osacn (Sacll-C/al) were 
Iigated into P 5 plasmid opened atXhol and C/al. 

10 Figure 22a SDS PAGE of 35 S-labelled HIV-1 BX08 envelope glycoproteins radio-immuno 

precipitated from transiently transfected 293 cells using the indicated plasmids. Cell pellet 
(membrane bound antigens) or cell supernatant (secreted antigens) were precipitated by 
a polyclonal anti-HIV-1 antibody pool. Lane 1: untransfected cells. Lane 2: supernatant 
from syn.gp120 M N transfected cells. Lane 3: cell pellet from wt.gp160 B xos transfected cells. 

15 Lane 4: cell pellet from cells co-transfected by wt.gp160 B xo8 and pRev. Lane 5: Mwt. 
marker. Lane 6: cell pellet from syn.gp160 B xo8 transfected 293 cells. Lane 7: cell pellet 
from syn.gp150 B xos transfected 293 cells. Lane 8: supernatant from syn.gp140 B xo8 
transfected cells. Lane 9: supernatant from syn.gp120 B xo8 transfected cells. 

20 Figure 22b is an SDS-PAGE of 35 S-labeled HIV-1 BX08 envelope glycoproteins radio- 
immune precipitated from transiently transfected 293 cells as cell pellet (membrane 
bound) or cell supernatant (secreted antigens) by anti-HIV-1 antibody pool using the 
indicated plasmids. Lane 1: untransfected 293 cells. Lane 2: cell pellet from syn.gp160MN 
transfected 293 cells as positive control (Vinner et al 1999). Lane 3: Cell supernatant from 

25 syn.gp120MN transfected 293 cells as positive control (Vinner et al 1999). Lane 4: Cell 
supernatant from syn.gp120BX08 transfected 293 cells demonstrating a glycoprotein 
band of 120 kDa. Lane 5: Cell supernatant from syn.gp140BX08 transfected 293 cells 
demonstrating a glycoprotein band of 120 kDa. Lane 6: Mwt. marker. Lane7 at two 
different exposure times: Cell pellet from syn.gp150BX08 transfected 293 cells 

30 demonstrating a glycoprotein band of 120 kDa (lower gp30 band is not well seen in this 
exposure). Lane 8; Cell supernatant from syn.gp150BX08 transfected 293 cells showing 
no secreted proteins (all protein is membrane bound, see lane 7). 

Figure 22c show fluorescent microscopy of U87.CD4.CCR5 cells transfected with BX08 
35 gp160 genes plus pGFP. Panel A: cells transfected with empty WRG7079 vector plus 
pGFP showing no syncytia. Panel B: cells transfected with wild type BX08gp160 gene 
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plus pGFP showing some syncytia. Panel C: cells transfected with synBX08gp160 plus 
pGFP showing extreeme degree of syncytia formation. This demonstrates expression, 
functionality, and tropism of the expressed BX08 glycoprotein with much more expressed 
functionally active gp160 from the synthetic BX08 gene. 

5 

Figure 23 shows the anti-Env-V3 BX08 antibody titers (lgG1). Panels show individual mice 
DNA immunized with syn.gp140BX08 plasmid either i.m. (left panel) or by gene gun (right 
panel), respectively. Immunization time points are indicated by arrows. 

10 Figure 24 shows a Western Blotting of (from left to right) one control strip, followed by sere 
(1:50) from 2 mice i.m. immunized with synBX08gp120, 2 mice i.m. immunized with 
synBX08gp140, 2 mice i.m. immunized with synBX08gp150, and 2 mice immunized with 
synBX08gp160, followed by 2 mice gene gun immunized with synBX08gp120, 2 mice 
gene gun immunized with synBX08gp140, 2 mice gene gun immunized with syn 

15 BX08gp150, and 2 mice gene gun immunized with synBX08gp160 respectively. Strip 5 is 
a mouse 5.1 DNA immunized i.m. with synBX08gp140 plasmid (same mouse as in figure 
23). Plasma was examined at week 18. The positing of gp160 (spiked with four coupled 
gp51), gp120 and gp41 is indicated at the right. A positive reaction to HIV glycoproteins 
futher demonstrates the mouse anti-HIV immunoglobulin reacting to HIV of a strain (NIB) 

20 different from BX08 to illustrate cross-strain reactivity. 

Figure 25 Theoretical example of calculation of the 50% inhibitory concentration (IC 50 ) 

values. IC 50 for each mouse serum is determined by interpolation from the plots of percent 
inhibition versus the dilution of serum. 

25 

Figure 26 CTL responses were measured at week 18 to the mouse H-2D d restricted BX08 
V3 CTL epitope (IGPGGRAFYTT) for BALB/c mice (H-2D d ) i.m. immunized at week 0, 9, 
and 15 with the synthetic vaccine genes: syn.gp120 BX0 8, syn.gp140 BX0 8, syn.gp150 B xo8, 
and syn.gp160 BX 08, respectively, and median values of different E:T ratios for groups of 
30 mice are shown (26A). Intramuscular DNA immunization with syn.gp150 B xos induced a 
higher CTL reponse when injected i.m. in high amounts versus gene gun inoculation of 
skin (26B). 

Figure 27 Summary of western immuno blotting assay of mice sera (1:40) collected at week 
35 0, 9, and 18 from mice genetically immunized with syn.gp120 BX0 8, syn.gp140 BX0 8, 
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syn.gp150 BX o8, syn.gp160 BX 08, wtgp160 BX08 , and wt.gp160 BX0 8 plus pRev, respectively. 
Percent responders in groups of 17-25 mice against gp120 and gp41 are shown. 

Figure 28 IgG anti-rgp120 (IIIB) antibody titers of individual mice inoculated at week 0, 9, 15 
(28A), or gene gun immunized at week 3, 6, 9, and 15 (28B) with the syn.gp150 B xo 8 DNA 
vaccine. 

Figure 29 IgG antibody titers to HIV-1 rgp120 mB . Median titers are shown from groups of 
mice i.m. inoculated at week 0, 9, and 15 (29A), or gene gun immunized at week 0, 3, 6, 
9, and 15 (29B) with the synthetic genes syn.gp120 BX08 , syn.gp140 BX08} syn.gp150 BX08 , 
and syn.gp160 BX 08, respectively. 
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Examples 

Example 1: Designing the nucleotide sequence construct 

Initially the overall layout of the nucleotide sequence construct is decided. The overall layout 
comprises the various derivatives the gene will be expressed as. For BX08 these include, but 
5 are not restricted to gp160, gp150, gp140, gp120, and gp41. 

Next, the vehicle of expression (plasmid or virus) is to be determined: Preparation for a 
suitable vector determines both need for leader sequence, terminal restriction enzyme sites 
and whether or not an N- or C-terminal protein tag is to be considered (Poly-his, Myc- 
antibody-epitop, etc.). For BX08 a plasmid expression vehicle was chosen. All native wild 

10 type HIV codons are systematically exchanged with the codons most frequently represented 
in a pool of highly expressed human genes (figure 1). By this exchange the amino acid 
sequence is conserved while the nucleotide sequence is dramatically altered. Thus, gene 
structures like overlapping reading frames (e.g. vpu, rev, and tat) or secondary structures 
(e.g. RRE) are most likely destroyed whereas protein cleavage sites, and glycosylation sites 

15 are maintained. The 100% amino acid identity between wtBX08 and synthetic BX08 in the 
present examples should be calculated after the initial Ala-Ser amino acid sequence, as that 
sequence is a part of the 6 amino acid sequence long Nhe\ restriction enzyme site. 

Depending on the restriction enzyme sites located in the expression vector it is decided 
20 which restriction enzyme sites can be present (tolerated) throughout the finished gene 

construct. The terminal restriction enzyme sites of the synthetic gene must remain unique to 
enable cloning into the vector chosen for expression. General requirements for restriction 
enzyme sites of choice: Preferably creating cohesive ends facilitating ligation, creating no 
compatible ends with adjacent restriction enzyme sites (e.g. BamHUBglU), and being efficient 
25 cutters. For BX08 the restriction enzyme sites accepted were the ones present in the 
polylinker of the pBluescript cloning vectors (Eagl, Mlu\, EcoRV, Pst\, C/al, EcoRI, Xbal, 
Sad, Spel, Xho\, Hind\\\, Sacll, Not\, BamH\, Sma\, Sa/I, Oral, Kpn\ with the exception of 
BglW and Nhe\). This was decided to satisfy the original cloning strategy using individual 
cloning of snuts in pBluescript with restriction enzyme cleaved (trimmed) ends after PCR 
30 amplification, which is not necessary when blunt-end cloning and assembling of 

complementary oligonucleotides are employed. All locations at which the selected restriction 
enzyme sites can be introduced by silent mutations (keeping 100% loyal to the amino acid 
sequence) are identified using the SILMUT software or equivalent. 
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From these possible restriction enzyme sites, a selection of restriction enzyme sites are 
introduced by silent nucleotide substitutions around functional regions of choice of the 
corresponding gene (e.g. RRE) or gene products (e.g. variable region 1 (V1), V2, V3, CD4 
binding area, transmembrane domain, and regions of immunological significans, etc.). 
5 Restriction enzyme sites are located at terminal positions of subcloned snuts (building 

entities) but additional restriction enzyme sites may be present within subunits. For BX08 the 
construct was initially to be cloned in the WRG7079 vector containing a tPA-leader 
sequence. Cloning sites were 5'-Nhe\ BamHl-3\ The entire humanised BX08 sequence 
was divided into thirds: 5'-Nhe\ -» Xho\ Sad I San?HI-3 5 . These sites were chosen in 
10 this particular order because it resembles the polylinker of pBluescript (KS~) enabling 
successive ligations of the assembled thirds in this cloning vector. Within these thirds 
restriction enzyme sites were kept unique. Next, restriction enzyme sites were placed to flank 
the functional regions chosen as follows: 

A. (5-V1): EcoRV-235: Between C1 and V1. Alternatives: 3xH/rtGflll (already excluded 
15 because exclusive use at position 1890) or EcoRV. 

B. (V1-3'): Only alternative Pst\ 375. 

C. (5'-V2): as B. 

D. (V2-3'): Alternatives: Spel, C/al 495. C/al chosen because it is closer to V2. 

E. EcoRI 650 placed because next possible site was too far away. 

20 F. (5-V3): BglW 720 was the alternative closest to the V3 region and further more unique. 

G. (V3-3'): Alternatives Xho\ (excluded) and Xbal 900 located very close to the V3 loop. 

H. (5'-V4) Sad 990: alternatively EcoRI or BamH\ (both excluded) 

I. (V4-3'): Alternatives Spel 1110, Kpn\ 1145, Pst\ 1135. Pst\ already used, Spel chosen 
because of distance to previous site (Sad 990). 

25 J. (5-V5): Xho\ initially determined. 

K. (Fusion peptide-3') Pst\ 1465 was the closest alternative to Xho\ 1265. 

L. (S'-lmmunodominant region): Xbal 1630 chosen among EcoRV (blunt end), Pst\ and Xho\ 

(both already used). 

M. (Immunodominant region-3'): Eagl 1700 perfect location. 
30 N. (C34 and C43 -3' (Chan, Fass, et al. 1997), and 5'-trans membrane domain): Sad I. No 
alternatives. 

O. (Trans membrane domain -3'): Sadl 2060 already present. 
P. C/al 2190 perfect position in relation to previous RE-site. 
Q: Pstl perfect position in relation to previous RE-site. 
35 R. EcoRI 2400 introduced to facilitate later substitution of terminal snut. 
S. eamHI 2454 determined by the WRG7079 vector. 
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Remove undesired restriction enzyme sites by nucleotide substitutions (keeping loyal to the 
amino acid sequence). Nucleotide substitution should preferably create codon frequently 
used in highly expressed human genes (figure 1). If that is not possible, the codons should 
5 be the selected from the regular codons (figure 7). The substitutions made to the second 
nucleotide sequence to obtain desired restriction enzyme sites are shown in Table 2. 



Table 2 lists silent nucleotide substitutions in the humanised BX08 envelope 
10 sequence. Substitutions were made to create or delete restriction enzyme sites. 
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rscinarKS. 
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uit?ait;o iviiu i oiie on pos. io*f- loy 
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C — > I 


Ucdlcb dOUr\ V Sllc Ufl pub. ^O0 - ^*rO 


\J\J 1 


C — > 3 


creaxes oia i sue on pos. ou i-ouo 




a — > i 


uu 




g — > c 


UU 


504 


c — > g 


Hn 


657 


c — > a 
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C — r I 


Hn 
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c — > a 


froafoe Rnl II c i+o r\r» r»ne 704 "7QQ 

(/icaico ogi n oiie on pos. /^*t-/^y 


726 


c — > y 


do 


727 


a — > I 


Hn 

UU 


728 


9 — > c 


Hn 

UU 


729 


O — > I 


Hn 

UU 


840 


r — v t 


I — ay I ol ic io ~ 1 1 1 1 ill la icu 


904 


a — t i 


Wl CCJICO AUa I Ol IC Ul 1 pUO. C7U*t 5/vv 


905 


y — > u 


do 


906 


r — v t 


do 


907 


c — > a 


Hn 

UU 


909 


c — > a 


do 


994 


a -> t 


creates Sac 1 site on pos. 990-995 


995 


g -> c 


do 


1116 


c->t 


creates Spel site on pos. 1 1 14-1 119 


1119 


c-> t 


do 


1273 


a->t 


creates Xhol site on pos. 1272-1277 


1274 


g -> c 


do 


1275 


c g 


do 


1293 


c-»t 


Bgl II site is eliminated 


1443 


c->t 


BstXI site is eliminated 


1452 


g->c 


do 


1467 


c->t 


Pstl site on pos. 1466-1471 


1470 


c ^ a 


do 


1590 


g -> c 


Pstl site on pos. 1588-1593 is eliminated 


1620 


g -> c 


Pstl site on pos. 1618-1623 is eliminated 


1638 


c^t 


creates Xbal site on pos. 1638-1643 


1641 


g -> a 


do 


1653 


g -> c 


Pstl site is eliminated 


1687 


a->t 


Pstl site is eliminated 


1688 


g-> c 


Pstl site is eliminated 


1710 


c -> g 


creates Eagl site on pos. 1709-1714 


1758 


c->t 


Bgl II site is eliminated 


1875 


g -> c 


Pstl site is eliminated 


1893 


c-> a 


Hind III on pos. 1893-1898 


1897 


c-> t 


do 


1944 


c-> t 


Bgl II site is eliminated 
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Position: 


substitution 


Remarks: 


2199 


c-*t 


Cla I site on pos. 2198-2203 


2202 


c -» t 


do 


2203 


c->t 


do 


2253 


c-> g 


Sacll site is eliminated 


2292 


g -> c 


Pstl site is eliminated 


2320 


a->t 


Pstl site on pos. 2321-2326 is eliminated 


2321 


a->t 


do 


2322 


g->t 


do 


2325 


c-> a 


do 


2430 


c-> a 


creates EcoRI site on pos. 2429-2434 


2433 


c-» t 


do 



Example 2: synthesis of oligos 

In order to clone the individual snuts, nucleotide strands were synthesised or purchased. In 
total 28 synthetic nucleotide strands were synthesised. Nucleotide strands were synthesised 
5 by standard 0.2 jumol p-cyanoethyl-phosphoramidite chemistry on an Applied Biosystems 
DNA synthesiser model 392, employing 2000 A CPG columns (Cruachem, Glasgow, 
Scotland), acetonitrile containing less than 0.001% water (Labscan, Dublin, Ireland) and 
standard DNA-synthesis chemicals from Cruachem, including phosphoramidites at 0.1 M 
and Tetrahydrofuran/N-methylimidazole as cap B solution. The nucleotide strands O-N-C 

10 and 1 19MS-RC (for cloning of snut O-N-Lang), 650-E-BG and 720-XBAC-31 (for cloning of 
snut 650-720-EcoRI), 2425esup and 2425ESdo (for cloning of snut 2425-E-S) were 
synthesised with 5' end "trityl on" and purified on "Oligonucleotide Purification Cartridges" 
(Perkin Elmer, CA, USA) as described by the manufacturer. Other nucleotide strands (235- 
ECOS, 375-pst1.seq, 495-Cla1.seq,900-Xbal, 990-sad, 1110-SPE, 1630-Xba.seq, 1700- 

15 Eag.seq, 17-Eag.seq, 1890-Hind.MPD, 2060-sac, 2190-cla, 2330-pst) were synthesised with 
5' end "trityl off' and purified by standard ethanol precipitation. Oligoes 1265-1 UP, 1265- 
1DO, 1265-2UP, 1265-2DO, 1265-3UP, 1265-3DO, 1465-1UP, 1465-1DO, 1465-2UP, 1465- 
2DO were purchased from Pharmacia. 

Example 3: Cloning of snuts 

20 The nucleotide sequence construct was designed in 17 DNA small pieces called snuts 

(Table 3) encompassing important structures like variable and constant regions each flanked 
with restriction enzyme (RE) sites to facilitate cassette exchange within each third of the 
gene: Nhe\-Xho\, Xho\-Sac\\, Sacll-SamHI. 

25 Each snut was cloned individually in a commercial vector (pBluescriptKS or pMOSblue) and 
kept as individual DNA plasmids, named after the snut which gives the nucleotide position of 
the RE in the BX08. 
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Table 3 list the snuts by their name and cloning vector. 



Name 


Cloning vector: 


So-N-Lang 


pMOSblue 


S235EcoRV 


pMOSblue 


S375Pstl 


pBluescriptSK 


^>495Clal 


pMOSblue 


S650-720ECORI 


pMOSblue 


SgooXbal 


pMOSblue 


S990Sacl 


pMOSblue 


^mospel 


pMOSblue 


^1265Xhol 


pBluescriptSK 


Si465Pstl 


pBluescriptSK 


Sl630Xbal 


pBluescriptSK 


SuoOEagl 


pBluescriptSK 


Si890Hindlll 


pBluescriptSK 


S2060Sacll 


pMOSblue 


S2l90Clal 


pMOSblue 


S2330Pstl 


pMOSblue 


S2425ES 


pBluescriptSK 



Three principally different methods were used to obtain the dsDNA corresponding to each of 
5 the 17 snuts needed to build the synthetic BX08 genes. 

1) "Overlapping" PCR: is based on the use of two ssDNA template nucleotide strands 
(forward and reverse) that complement each other in their 3'-end (figure 8). During the first 
PCR cycle, both templates annealed to each other at the 3'-ends allowing the full-length 
10 polymerisation of each complementary strand during the elongation step. The newly 

polymerized dsDNA strand are then amplified during the following cycles using an adequate 
forward and reverse primers set (figure 8). 

Snut O-N-LANG: (S 0 -N-Lang) two ng of the forward template nucleotide strands O-N-C and 2 
15 ng of the reward template-nucleotide strand 1 1 9MS-RC were mixed together with 50pmoles 
of the forward primer O-N-LANG-5 (5'-CTAGCTA-GCGCGGCCGACCGCCT -3') and 
SOpmoles of the reverse primer O-N-LANG-3 (5-CTCGATATCCTCGTGCATCTGCTC -3') in 
a 100|xl PCR reaction volume containing 0.2mM dNTP's, 1x ExpandHF buffer with MgCI 2 
(1.5mM) and 2.6 units of enzyme mix (Expand™ High fidelity PCR system from Boehringer 
20 Mannheim). The PCR was performed with the PE Amp 9600 thermocycler (Perkin Elmer) 
using the following cycle conditions: initial denaturation at 94°C for 30 sec, followed by 30 
cycles of 94°C for 15 sec, 65°C for 30 sec, 72°C for 45 sec, with a final elongation at 72°C 
for 5 min., and cooling to 4°C. 
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Snut 650-720-EcoRI: (S 650 -72oecori) PCR amplification was performed as described for snut 
O-N-LANG. One jig of the forward ssDNA template-oiigonucleotide 650-E-BG and 1 jag of 
the reverse ssDNA template-oiigonucleotide 720-XBAC were mixed with 40pmoles of the 
forward primer 650-E-5 (5-CCGGAATT-CGCCCCGTGGTGAGCA-3') and 40 pmoles of the 
5 reverse primer 720-X-3 (5-CTGCTCTAGAGATGTTGCAGTGGGCCT-3'). 



2) "Normal" PCR amplification: Eleven nucleotide strands: 235-EC05, 375-pst1, 900-xba1, 
990-sad, 1110-SPE, 1630-XBA, 1700-EAG, 1890-HIN, 2060-sac, 2190-cla, and 2330-pst, 
were designed with common 5 J and 3' flanking sequences which allowed PCR amplification 

10 with the same primer set (Forward primer : BX08-5 (S'-AGCGGATAACAATTTCACACAGGA- 
3') and revers primer : BX08-3 (5'-CGCCAGGGTTTTCCCAGTCACGAC-3') (Figure 9). The 
495-Cla1 oligonucleotide was designed without a common flanking sequence and was 
therefore amplified with a specific set of primers 495-5N/495-3N ( 5'-GAATCGAT- 
CATCACCCAG-3* and 5'-GACGAATTCCGTGGGTGCACT-3'). Each oligonucleotide was 

15 resuspended in 1ml of water and kept as a stock solution (approximatively 0.2 mM). PCR 
amplification was performed with the Expand™ High Fidelity PCR System from Boehringer 
Mannheim (Cat. No. 1759078). Four concentrations of template nucleotide strand were 
systematically used: undiluted stock solution, stock solution 10~\ stock solution 10~ 2 , stock 
solution 10' 3 . One to 5 jJ of synthetic ssDNA template was amplified using the following 

20 conditions: BX08-5 (O.S^M), BX08-3 (O.SjuM), 4 dNTP's (0.2mM), 1x ExpandHF buffer with 
MgCI 2 (1.5mM) and 2.6 units of enzyme mix. The PCR was performed using the PE Amp 
9600 thermocycler (Perkin Elmer) using the following cycle conditions: initial denaturation at 
94°C for 15 sec, followed by 30 cycles of 94°C for 1 5 sec, 65°C for 30 sec and 72°C for 45 
sec, with a final elongation at 72°C for 7 min., and cooling to 4°C. 

25 

3) Minigene approach : This method was used to synthesise S 12 65Xhoi, S 14 65Xbai and S 242 ses. 
Snut 2425-E-S (S 2425 es): 100 picomoles of each oligonucleotide 2425ES-up (35-mer ; 5'- 
AATTCGCCAGGGCTTCGAGCGCGCCCTGCTGTAAG-3') and 2425ES-do (35-mer ; 
GATCCTTACAGCAGGGCGCGCTCGAAGC-CCTGGCG-3') were mixed together in a 100 ]x\ 

30 final volume of annealing buffer containing NaCI 25mM , Tris 10mM and 1mM EDTA. After 
denaturation at 94°C for 15 min M the mixed oligonucleotides were allowed to anneal at 65°C 
during 15min.. The annealing temparature was allowed to slowly decrease from the 65°C to 
room temperature (22°C) during overnight incubation. The resulting double-strand dsDNA 
fragments harbored EcoRI- and BamHI-restriction sites overhangs that allowed direct cloning 

35 in pBluescript KS(+) vector using standard cloning techniques (Maniatis 1996). 
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Snut 1265-Xhol (S 126 5Xhoi): This snut was built according to the strategy depicted in figure 10. 
Three minigenes were constructed following the same method described for snut 2425-E-S. 
These minigenes are named 1265-1, 1265-2 and 1265-3. The minigene 1265-1 results from 
the annealing of the oligonucleotides 1 265-1 up (68-mer ; 5'-tcg agc agc ggc aag gag att 

5 TTC CGC CCC GGC GGC GGC GAC ATGC GCG ACA ACT GGC GCA GCG AGC T-3') and 1 265-1 do (68- 
mer ; 5-GTA CAG CTC GCT GCG CCA GTT GTC GCG CAT GTC GCC GCC GCC GGG GCG G AAA ATC 

tcc ttg CCG ctg c-3'). 1265-2 results from the annealing of 1265-2up (61-mer ; 5'-GTA CAA 

GTA CAA GGT GGT GAA GAT CGA GCC CCT GGG CAT CGC CCC CAC CAA GGC CAA GCG C-3') and 
1 265-2d0 (63-mer ; 5'-CAC GCG GCG CTT GGC CTT GGT GGG GGC GAT GCC CAG GGG CTC GAT 

1 0 CTT cac CAC CTT gta CTT-3'). Finally, 1 265-3 results from the annealing of 1 265-3up (69-mer 

; 5'-CGC GTG GTG CAG CGC GAG AAG CGC GCC GTG GGC ATC GGC GCT ATG TTC CTC GGC TTC CTG 
GGC GCT GCA-3') and 1265-3d0 (59-mer ; 5'-GCG CCC AGG AAG CCG AGG AAC ATA GCG CCG 

atg ccc acg GCG cgc ttc tcg cgc tgc ac-3'). Each minigene were designed in order to 
present single strand overhangs at their 5' and 3'- ends that allow easy ligation and Xhol-Pstl 
15 direct cloning into pBlueScript KS+ vector. 

Snut 1465-Pstl (S 146 5 Ps ti): Two minigenes were constructed following the same methode 
described for snut 2425-E-S. These minigenes are named 1465-1 and 1465-2. The minigene 
1465-1 was obtained after annealing of 1465-1 up (90-mer : 5-GGC AGC ACC atg ggc GCC 

20 GCC AGC CTG ACC CTG ACC GTG CAG GCC CGC CAG CTG CTG AGC GGC ATC GTG CAG CAG CAG 

aac aac ctg ctg-3') and 1465-1 do (98-mer : 5'-cgc gca gca ggt tgt tct gct gct gca cga 

TGC CGC TCA GCA GCT GGC GGG CCT GCA CGG TCA GGG TCA GGC TGG CGG CGC CCA TGG TGC 

tgc ctg ca-3'), whereas minigene 1465-2 results from the annealing of 1465-2up (78-mer ; 

5-CGC GCC ATC GAG GCC CAG CAG CAC CTG CTC CAG CTGA CCG TGT GGG GCA TCA AGC AGC TCC 
25 AGG CCC GCG TGC TGG CT-3') and 1465-2dO (78-mer ; 5'-CTA GAG CCA GCA CGC GGG CCT GGA 
GCT GCT TGA TGC CCC ACA CGG TCA GCT GGA GCA GGT GCT GCT GGG CCT CGA TGG-3'. Each 

minigene were designed in order to present single strand overhangs at their 5' and 3'- ends 
that allow easy ligation and Pstl-Xbal direct cloning into pBlueScript KS+ vector using 
standard cloning techniques (Maniatis) (see figure 11). 

30 Example 4: assembly of snuts to pieces. 

The snut genes were then assembled into pieces (Table 4) so that unique restriction enzyme 
sites or mutagenesis can be used within each of these. This strategy will require fewer 
assemblings for optimal use of the cassette system. The following piece clones were made 
and kept individually for construction of the synBX08 gp160 gene (Figure 6): 
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Table 4 lists pieces by their name and their snut composition. 



Piece 


snut composition 


vector 


name 






Pi 


So-N-LANG"S235EcoR\rS375p s t| 


pBluescriptSK 


P 2 


SgooXbar S ggosacr S 1 1 <| 0 Spel 


pMOSbiue 


P 3 


Pl-S495Clal"S650-720EcoRrP2 


pBluescriptSK 


P4gp160 


Si63OXbarSi7O0EagrSi89OHin!ll 


pBluescriptSK 


P 5 


S2190Clal"S2330PstrS2425ES 


pBluescriptKS 


P 7 


P8gp16O-S2060SacirP5 


pBluescriptKS 


Psqpieo 


Si265XhorS 1 4 65 p str P 4aDl 60 


pBluescriptKS 



Piece 1: The building strategy is shown in figure 12. 

Preparation of the insert DNA: Five to 15jig of each plasmid 0-N-LANG-cl7 and 235-EcoRV- 
5 cl5N, respectively, were double-digested by Xbal/EcoRV, and Pstl/EcoRV, according to 
classical RE digestion procedure (Maniatis). The RE digestion products, were agarose gel 
purified according classical method (Maniatis). All RE digests were loaded on a 3% Nusieve 
3:1(FMC), TBE 0.5X agarose gel and submitted to electrophoresis (7 Volts/mm during 2- 
3hours) until optimal fragment separation. The agarose-band containing the DNA fragments 

10 that correspond to the snut's sequence sizes (243-bp for O-N-LANG and 143-bp for 235- 
EcoRV) were excised from the gel. The DNA was extracted from agarose by centrifugation 
20min at 5000g using a spin-X column (Costar cat#8160). Preparation of the vector: The 
snut 375-Pst1 klonl was used as plasmid vector. Five jug were digested with Xbal and Pst1. 
Removal of the polylinker Xbal/Pstl fragment was performed by classical agarose gel 

15 purification, using a 0.9% Seakem-GTG agarose, TBE 0.5X gel. The linearised plasmid DNA 
was extracted from the agarose by filtration through spin-X column. All purified DNA 
fragments were quantified by spectrophotometry. Ligation: All three DNA fragments O-N- 
LANG (Xbal/EcoRV), 235-EcoRV (Pstl/EcoRV) and 375-Pstl(Xbal/Pstl), were ligated 
together by classical ligation procedure, using an equimolar (vector:insert1:insert2) ratio of 

20 1:1:1. Thus for, 200 ng (0.1 pmole) of Xbal/Pstl-linearised 375-Pstl-cl1 were mixed with 16 
ng of O-N-LANG (Xbal/EcoRV) and 10 ng of 235-EcoRV (Pstl/EcoRV) in a final reaction 
volume of 20jal of 1X ligation buffer containing 10U of T4 DNA ligase (Biolabs, cat#202S). 
The ligation was allowed overnight at 16°C. Transformation: Competent XL1-Blue bacteria 
(Stratagene cat#200130, transformation efficiency > 5*10 6 col/^ig) were transformed by 

25 classical heat-chock procedure : 1/1 0th of the pre-chilled ligation reaction was mixed with 
50|ul of competent bacteria. The mixture was allowed to stand in ice during 30 min. Bacteria 
were heat-shocked at 42°C during 45 sec. and then left 2 min. on ice before being 
resuspended in 450 jxl of SOC medium. Transformed bacteria were incubated 1 hour at 37°C 
under shaking (250rpm) and plated on LB-ampicilin agar plates. The recombinant clones 
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were allowed to grow 16 hours at 37°C. Colony screening: 10 to 50 recombinant colonies 
were screened by direct PCR screening according to the protocole described into the 
pMOSBlue blunt-ended cloning kit booklet (RPN 5110, Amersham). Each colony was picked 
and resuspended in 50^1 of water. DNA was freed by a boilling procedure (100°C, 5 
5 min).Ten ^il of bacterial lysate were mixed to 1 jjJ of a 10mM solution of premixed 4 dNTP's , 
1 of M13reverse primer (Spmoles/^il, 5-CAGGAAACAGCTATGAC-3'), 1 v\ of T7 primer 
(5pmoles/^l, 5-TAATACGACTCACTATAGGG-3'), 5^il of 10x Expand HF buffer 2 
(Boehringer Mannheim, cat#1 759078), 0.5|il of Enzyme mix (Boehringer Mannheim, 5U/jxl) 
in a final volume of 50|ul. DNA amplification was performed with a thermo-cycler PE9600 

10 (Perkin-Elmer) using the following cycling parameters: 94°C, 2min, 35 cycles(94°C, 30sec; 
50°C f 15sec; 72°C,30sec); 72°C, 5min; 4°C hold. Five of the PCR products were analysed 
after electrophoresis on a 0.9% SeakemGTG , O.SxTBE agarose gel. Nucleotide sequence 
confirmation: ds-DNA was purified from minicultures of the selected clones with the JETstar 
mini plasmid purification system ( Genomed Inc.). Sequencing was performed using 

15 M13reverse and T7 primers and with the Big DyeTM Terminator Cycle Sequencing Ready 
reaction kit (Perkin-Elmer, Norwalk, Connecticut, P/N4303152) and the ABI-377 automated 
DNA sequenator (Applied Biosystems, Perkin-Elmer, Norwalk, Connecticut). Data were 
processed with the Sequence Navigator and Autoassembler softwares (Applied Biosystems, 
Perkin-Elmer,Norwalk, Connecticut). 

20 

Piece 2: The strategy for building that piece is depicted in figure 13. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The linearised plasmid 1 1 10-Spel-cl24M1 was used as vector after being digested by 
25 Hindlll and Spel, and agarose gel purified. 

- A 166-bp Hindlll/Sacl, obtained from snut 900-Xbal-cl15, as well as a 130-bp Sacl/Spel 
fragment, obtained from snut 990-Sacl-cl14, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were iigated 
in an one step ligation. 

30 - 100nl of competent SCS1 10 bacteria (Stratagene cat# 200247) were transformed with 
1/10th of the ligation products according to the manufactor instruction. 

- Direct colony PCR screening was performed using T7 primer and pMOS-R (5- 
GTTGTAAAACGACGGCCAG-3'). 
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Piece 3: The strategy for building that piece is depicted in figure 14. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel , except the following: 

- The plasmid piecel -c!33 was linearised by Clal and Xhol, in order to be used as vector, 
5 and agarose gel purified. 

- A 161 -bp Clal/EcoRI fragment, obtained from 495-Clal-cl135M1 as well as a 254-bp 
EcoRI/Xbal fragment, obtained from 650-720-EcoRI-c!39 } and a 374-bp Xbal/Xhol fragment, 
obtained from piece2-cl4, were agarose gel purified. 

- Equimolar amount (0.1 pmole) of each of these 4 DNA fragments were mixed and ligated 
10 together. 

- 50jjJ of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel . 

- Direct colony PCR screening was performed using M13Reverse and T7 primers. 

15 Piece 4 gp160: The strategy for building that piece is depicted in figure 15. RE digestion, 
DNA fragments purification, ligation as well as direct PCR screening of recombinant colonies 
were performed according the same procedures described above for piecel, except the 
following: 

- The plasmid 1630-Xbal-cl2 was linearised by Sacll/ Eagl digestion and agarose gel 
20 purified, in order to be used as vector. 

- A 190-bp Eagl/Hindlll fragment, obtained from snut 1700-Eagl-cl4, as well as a 177-bp 
Sacll/Hindlll fragment, obtained from snut 1890-Hindlll-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

25 - 50jal of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel 

- Direct colony PCR screening was performed using M13Reverse and T7 primers. 

Piece 4-gp150: PCR-based site-directed mutagenesis was performed on double-stranded 
30 plasmid-DNA from piece4-cl4 according an adaptation of the ExSite™ PCR-Based Site- 
Directed Mutagenesis Kit procedure (Stratagene cat#200502)(Weiner, M.P., Costa, G.L., 
Schoettlin, W., Cline, J., Marthur, E., and Bauer, J.C. (1994) Gene 151:119-123). The 
mutations introduced are shown in bold letters in the primer sequences below. PCR 
amplification was performed with the Expand™ High Fidelity PCR System (Boehringer 
35 Mannheim, cat#1759078). Briefly, 1 .5 \xg, 0.5 \ig or 0.1 \xg of circular dsDNA was mixed with 
1.5 pmoles of P4M2S (5'-TCTGGAAGCTCAGGGGGCTGCATCCCTGGC-3') and 1.5 
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pmoles of P4M2AS (5'-CCCGCCTGCCCGTGTGACGGATCCAGCTCC-3') in a final volume 
of 50pJ containing 4 dNTPs (250jiM each), 1x Expand HF buffer 2 (Boehringer Mannheim, 
cat#1 759078), 0.75|ul of Enzyme mix (Boehringer Mannheim, 5U/|il). The PCR was 
performed with a PE9600 thermo-cycler (Perkin-Elmer Corporation) under the following 
5 cycling parameters : 94°C, 2min ; 15 cycles (94 °C, 45 sec ; 68 °C, 4 min); 72°C, 7min and 4 
°C, hold. PCR products were phenolxhloroform extracted and precipitated (Maniatis). 
Plasmid template was removed from PCR products by Dpnl treatment (Biolabs) (Nelson, M., 
and McClelland, M., 1992) followed by ethanol-precipitation. Amplicons were resuspended in 
50 (al steril water, and phosphorylated according the following procedure: 7.5 jlxI of amplicons 

10 were mixed with 0.5^1 of 100mM DTT, 1 |il of 10x pk buffer and 1 jil of pk mix enzyme 

(pMOSBlue blunt-ended cloning kit, Amersham , cat#RPN 5110). DNA kinasing was allowed 
5min at 22°C. After heat-inactivation (10min, 75°C) of the pk enzyme, 1jil of ligase (4units, 
Amersham , cat#RPN 51 10) was directly added to the pk reaction. The ligation was allowed 
overnight at 22°C. 50jml of competent XLIBlue bacteria were transformed with 1/10th of the 

15 ligation reaction according to the classical protocol (Maniatis). Insertion of mutations was 
checked by sequencing. 

Piece 4-gp140: PCR-based site-directed mutagenesis was performed on piece 4-cl4, 
according to the procedure described for piece4-gp150 except that the primers P4M1AS (5'- 
20 TGTGTGACTGATTGAGGATCCCCAACTGGC-3') and P4S (5'- 
AGCTTGCCCACTTGTCCAGCTGGAGCAGGT-3') were used. 

Snut 1265-Xhol-gp120 : PCR-based site-directed mutagenesis was performed on plasmid 
1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except that the 
25 primers 1265MAS (5 , -CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3 , ) and 1265M2S (5'- 
CGCGCCTAGGGCATCGGCGCTATGTTCCTC-3') were used. 

Snut 1265-Xhol-gp160/uncleaved : PCR-based site-directed mutagenesis was performed 
on plasmid 1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except 
30 that the primers 1265MAS (5'-CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3') and 
1265M2S (5 , -AGCGCCGTGGGCATCGGCGCTATGTTCCTC-3 , ) were used. 

Snut 1465-Pstl-CCG : PCR-based site-directed mutagenesis was performed on plasmid 
1465-Pstl-cl25 according to the procedure described for piece4-gp150 except that the 
35 primers 1465MAS (S'-CTGCTTGATGCCCCACACGGTCAGCTG-S') and 1465MS (5'- 
TGCTGCGGCCGCGTGCTGGCTCTAGA-3') were used. 
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Piece 5 : The strategy for building that piece is depicted in figure 16. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 
5 - The plasmid 2425-ES-cl2 was linearised by Clal/EcoRI digestion and agarose gel purified, 
in order to be used as vector. 

- A 129-bp Pstl/Clal fragment, obtained 2190-Cial-cI6M15, as well as a 1 14-bp Pstl/Ecorl 
fragment, obtained from 2330-Pstl-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
10 in an one step ligation. 

- 50(0,1 of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel . 

- Direct colony PCR screening was performed using T3 (5'-ATTAACCCTCACTAAAG-3') and 
T7 primers. 

15 

piece 8 : The strategy for building that piece is depicted in figure 17. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece4-cl4 was linearised by Xbal/Xhol and agarose gel purified, in order to be 
20 used as vector. 

- A 200-bp Xhol/Pstl fragment, obtained from 1265-Xhol-cl2M1 as well as a 178-bp Pstl/Xbal 
fragment, obtained from 1465-Pstl-cl25 were agarose gel purified. 

- Equimolar amount (0.1 pmole) of these 3 DNA fragments were mixed and ligated together. 

- SOjliI of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
25 according to the protocole described for piecel . 

- Direct colony PCR screening was performed using T3 and T7 primers. 

piece 8-gp150 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M3 was used as vector (figure 18). 

30 

piece8-gp150/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
except that piece4 gp160-cl4M3 is used as vector and a 200-bp Xhol/Pstl fragment, obtained 
from snut 1265-Xhot-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from 
snut 1465-Pstl-CCG are used like inserts. 
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piece 8-gp140 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M5 was used as vector (figure19). 

piece8-gp140/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
5 except that piece4-cl4M5 is used as vector and a 200-bp Xhol/Pstl fragment, obtained from 
snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from snut 
1465-Pstl-CCG are used like inserts. 

Piece8-gp41 : The strategy for building that piece is depicted in figure 20. RE digestion, 
10 DNA fragments purification, ligation as well as direct PCR screening of recombinant colonies 
were performed according the same procedures described for synBX08 gp160 gene. A 63 
bp linker is to be made according to the method described for snut 2425-ES, the minigene 
appraoch. Thus for 2 complementary oligonucleotides: 1265-gp41S(5 > - 
TCGAGgctagcGCCGTGGGCATCGGCGCTATGTTCCTCGGCTTCCTGGGCGctgca-3') and 
15 1265-gp41AS (5'-gCGCCCAGGAAGCCGAGGAAC- 

ATAGCGCCGATGCCCACGGCgctagcC-3' should be annealed together. This synthetic 
linker will be directly ligated into the Xho\ /Pst\ sites of piece8-klon13 from which the snut 
1265-Xhol-cl 2M1 would have been removed. 

20 piece7 : The strategy for building that piece is depicted in figure 21 . RE digestion, DNA 

fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece5-cl1 was linearised by Clal/Xhol and agarose gel purified, in order to be 
used as vector,. 

25 - A 798-bp Xhol/Sacll fragment, obtained from piece8-cl13 as well as a 140-bp Sacll/Clal 
fragment, obtained from 2060-Sacll-cl21 were agarose gel purified. 

- The ligation of the 3 fragments was performed using a vectoninsert ratio of 1:1, 1:2 or 1:5. 

- 50|il of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel. 

30 - Direct colony PCR screening was performed using M13Reverse and T7 primers. 

Example 5: assembly of genes 

synBX08 gp160 gene : The strategy for building that gene is depicted in figure 6. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for piecel. 
35 20 jig of the expression plasmid WRG7079 were digested by Nhel/BamHI. Plasmid DNA- 
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ends were dephosphorylated by Calf Intestin Phosphatase treatment (CIP, Biolabs) 
(Maniatis) to avoid autoiigation of any partially digested vector. CIP enzyme was heat- 
inactivated and removed by classical phenol-chloroforme extraction. A 1277-bp Nhel/Xhol 
fragment, obtained from piece3-cl27, as well as a 1194-bp Xhol/BamHI fragment, obtained 
5 from piece7-cl1, were agarose gel purified. The ligation was performed using a vectoninsert 
ratio of 1:1 or 1:2. Fifty yi\ of competent XLIBlue bacteria were transformed with 1/1 0th of the 
ligation product according to the protocole described for piecel. After transformation bacteria 
were plated on LB-kanamycin agar plates. Direct PCR colony screening was performed 
using the primer set WRG-F (5'-AGACATAATAGCTGACAGAC-3') and WRG-R (5'- 
10 GATTGTATTTCTGTCCCTCAC-3'). The nucleotide sequence was determined according the 
methods described above for piecel. 

synBX08 gp150 gene : The strategy for building that gene is depicted in figure 5. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 

15 recombinant colonies were performed according the same procedures described for 

synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cI27, as well as 
a 800-bp Xhol/BamHI fragment, obtained from piece 8-gp150-cl26, were agarose gel purified 
and then ligated into the Nhel/BamHI WRG7079 sites. The ligation was performed using a 
vectoninsert ratio of 1:1 or 1:2. 

20 For construction of the synthetic BX08 gp150, piece4 was mutated to Piece4gp150 whereby 
a tyrosine -> cysteine was changed and a stop codon was introduced after the 
transmembrane spanning domaine ( TMD), followed by a BamHI cloning site. A new 
piece8gp150 was constructed composed of snut1265/snut1465/piece4gp150. 

25 synBX08 gp150/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
from piece3-cl27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 
gp150/uncleaved, are agarose gel purified and then are ligated into the Nhel/BamHI 

30 WRG7079 sites. The ligation was performed using a vector: insert ratio of 1 :1 or 1:2. 

synBX08 gp 140 gene : The strategy for building that gene is depicted in figure 4. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
35 synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 647-bp Xhol/BamHI fragment, obtained from piece 8-gp140-cl2, were agarose gel purified 
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and then iigated into the Nhel/BamHI sites of WRG7079. The ligation was performed using a 
vector: insert ratio of 1:1 or 1:2. For construction of the synthetic BX08 gp140, piece4 was 
mutated to Piece4gp140 whereby a stop codon was introduced just before the TMD followed 
by a BamHI cloning site. A new piece8gp140 was constructed composed of 
5 snut1265/1465/piece4gp140. 

synBX08 gp140/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
10 from piece3-cl27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 

gp140/uncleaved, are agarose gel purified and then Iigated into the Nhel/BamHI WRG7079 
sites. The ligation was performed using a vectoninsert ratio of 1:1 or 1:2. 

synBX08 gp 120 gene : The strategy for building that piece is depicted in figure 3. RE 
15 digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 206-bp Xhol/BamHI fragment, obtained from 1265-Xhol-gp120-clM5, were agarose gel 
purified and then Iigated into the Nhel/BamHI sites of WRG7079. The ligation was performed 
20 using a vector: insert ratio of 1:1 or 1 :2. For construction of the synthetic BX08 gp120, snut 
1265 was mutated to S 12 65g P i2o to introduce a stop codon at the gp120/gp41 cleavage site 
followed by a BamHI cloning site. 

The gp160, gp150, gp140, and gp120 genes are cloned (Nhei-BamHI) and maintained in an 
25 eucaryotic expression vectors containing a CMV promotor and a tPA leader, but other 

expression vectors may be chosen based on other criteria e.g. antibiotic resistance selection, 
other leader sequences like CD5 etc, presence or not of immune stimulatory sequences etc. 

SynBX08 gp41 gene : The strategy for building that gene is depicted in figure 20. RE 
30 digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
synBX08 gp160 gene. Piece 8-gp41 is Iigated with snut 2060-Sacll-klon21 and piece 5 as 
already decribed for the construction of piece 7, creating piece 7-gp41 (P 7g p4i). Subsequently 
the piece 7-gp41 containing the entire gp41 gene will be cloned in WRG7079 using the Nhel 
35 and BamHI sites. 
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Example 6a: High expression by codon optimization 

To analyse the expression of glycoproteins from the wild type and synthetic BX08 envelope 
genes RIPA was performed on transfected mammalian cell lines. Both cell membrane 
associated and secreted HIV-1 glycoproteins from the cell supernatants were assayed. The 
5 envelope plasmids were transfected into the human embryonic kidney cell line 293 (ATCC, 
Rockville, MD) or the mouse P815 (H-2D d ) cell line using calcium phosphate (CellPhect 
Transfection kit, Pharmacia). For radio immune precipitation assay (RIPA), transfected cells 
were incubated overnight, washed twice and incubated for 1 hour with DMEM lacking 
cysteine and methionine (Gibco). Then the medium was replaced with medium containing 50 

10 pCi per ml of [ 36 S] cysteine and 50 pCi/ml of [ 35 S] methionine (Amersham Int., Amersham, 
UK) and incubation continued overnight. Cells were centrifuged, washed twice with HBSS 
and lysed in 1 ml ice-cold RIPA buffer (10 mM Tris, pH 7.4, 150 mM NaCI, 50 mM EDTA, 1% 
Nonidet P-40, 0.5% sodiumdeoxycholate) to detect membrane bound Env glycoproteins. The 
cell lysates were centrifuged for 15 min. at 100,000 x g to remove any undissolved particles 

15 and 100 ^il immune precipitated with protein A-sepharose coupled human polyclonal IgG 
anti-HIV antibodies (Nielsen et al., 1987). For analysis of secreted Env glycoproteins 500 jul 
of the 5 ml supernatants from transfected cells were incubated with protein A-sepharose 
coupled human polyclonal IgG anti-HIV antibodies. After washing three times in cold RIPA 
buffer and once in PBS, the immuno precipitates were boiled for 4 min. in 0.05 M Tris-HCI, 

20 pH 6.8, 2% SDS, 10% 2-mercaptoethanol, 10% sucrose, 0.01% bromophenol blue and 

subjected to SDS PAGE (Vinner et al., 1999). Electrophoresis was carried out at 80 mVfor 1 
hour in the stacking gel containing 10% acrylamide, and at 30 mV for 18 hours in the 
separating gradient gel containing 5-15% acrylamide. Gels were fixed in 30% ethanol-10% 
acetic acid for 1 hour, soaked for 30 min. in En3Hance (Dupont #NEF 981), washed 2x15 

25 min. in distilled water, dried and autoradiography performed on Kodak XAR-5 film. 
Transfection of human 293 cells with the syn.gp120 BX 08 and syn.gp140 B xo8 genes, 
respectively, resulted in high amounts of only secreted HIV-1 glycoproteins (Fig. 22a, lane 9 
and 8). Thus, the synthetic gene in the absence of rev expresses the HIV-1 surface 
glycoprotein of the expected size which is recognised by human anti-HIV-1 antisera. The 

30 expression of BX08 gp120 was Rev independent and with roughly the same high amount of 
gp120 from the syn.gp120MN gene (Fig. 22a, lane 2). Fig. 22a, lane 6 and lane 7 shows the 
expression of only membrane bound gp160 and gp150 from 293 cells transfected with 
syn.gp160 B xo8 and syn.gp150 B xo8 plasmids, respectively. Also transfection with wt.gp160 B xo8 
plasmid resulted in a significant expression of membrane bound gp160 despite the absence 

35 of Rev (Fig. 22a, lane 3). Co-transfection with equimolar amounts of Rev encoding plasmid 
seemed to increase this expression somewhat (Fig. 22a, lane 4). This is seen despite the 
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lower transfection effectivity using two plasmids and the use of only half the amount of 
wt.gp160 B xo8 DNA when combined with pRev. The amounts of secreted HIV-1 glycoproteins 
from gp120 and gp141 accumulating in the cell supernatants seemed higher than the 
amounts of cell associated glycoproteins at the time of harvesting of the cells. Interestingly, 
5 the amounts of gp160 produced from the "humanized" gene were about equal to the 

amounts produced by the wt.gp160 B xo8 + pRev genes, respectively (Fig. 22a, lane 4 and 6). 
The processing of gp160, gp150 and gp140 into gp120 plus a gp41, or fractions of gp41, 
produced from wild type or synthetic genes in the 293 cell-line did not function well under 
these experimental conditions. Same phenomenon was seen in RIPA from 293-CD4 cells 
10 and HeLa-CD4 cells infected by HIV-1 M n (Vinner et al., 19999). Because of the absence of 
CCR5 these cell-lines could, however, not be infected by HIV-1 strain BX08. 

Example 6b: Radio immuno precipitation assay (RIPA) of synthetic BX08 transfected 
cells showing expression of glycoproteins from synthetic BX08 env plasmids 

15 The synthetic envelope plasmid DNA were transfected into the human embryonic kidney cell 
line 293 (ATCC, Rockville, MD) using calcium phosphate (CellPhect Transfection kit, 
Pharmacia). For immune precipitation analysis, transfected 293 cells were treated and 
analysed according to the method described in example 6a. 
To analyse expression from these genes, an SDS-PAGE of the 35 S-labelled HIV-1 

20 envelopes, immune precipitated from the transfected cells is shown in figure 22b. Both cell- 
membrane associated and secreted HIV-1 envelope glycoproteins in the cell supernatants 
were assayed. Transfection of 293 cells with the synthetic BX08 gene encoding gp120 
(syn.gp120BX08) in lane 4, and syn.gp140BX08 (lane 5) that did not contain rev encoding 
regions, resulted in abundant amounts of HIV-1 gp120. Thus, the expressions were Rev 

25 independent and expressed in roughly same high amounts as the syn.gp120MN and 

syn.gp160MN genes (lane 3 and 2, respectively) already showed by our group and others to 
be markedly increased in comparisson with HIV MN wild type genes including rev (Vinner et 
al 1999). 

Transfection with syn.gp150 plasmid (lanes 7 and 8) resulted in significant expression of 
30 membrane associated gp120 and low detactable amounts of truncated form of gp41 (cell 
pellet in lane 7) with no detectable HIV-1 glycoprotein in the cell supernatant lane 8. It is 
concluded that the synthetic BX08 genes expres the envelope glycoproteins of expected size 
which are recognised by human anti-HIV-1 antiserum. 
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Exampel 6C FACS 

To quantitate the surface expression of HIV glycoproteins from the wild type and synthetic 
BX08 envelope genes transfection experiments were done and cell surface expression 
5 examined by FACS (flow cytometer). 

10 ^ig of the BX08 envelope plasmid (wild type BX08gp160 or synBX08gp160) plus 10 ^ig of 
an irrelevant carrier plasmid pBluescript were used to transfect a 80-90% confluent layer of 
293 cells in tissue culture wells (25 cm 2 ) using the CellPect kit (Pharmacia). After 48 hours 

10 cells were Versene treated, washed and incubated with a mouse monoclonal IgG antibodies 
to HIVgp120 (NEA-9301, NEN™, Life-Science Products Inc., Boston) for time 30 min. on wet 
ice followed by washing in PBS, 3% FCS and incubation with Phyto-Erytrin (PE) labelled rat 
anti-mouse lgG1 (Cat #346270, Becton Dickinson) according to the manufacture. After 
washing the cells were fixed in PBS, 1% paraformaldehyd, 3% FCS, and analysed on a 

15 FACS (FACScan, Becton-Dicknsson). Table 5 show in duplicate expression of BX08 gp160 
from 1 1 % of the cells transfected with wild type BX08 (number 1 and 2) compared to the 48 
% of cells expression BX08 glycoprotein when transfected with the synthetic gene (number 3 
and 4). Thus, a several fold higher expression is obtained using the synthetic BX08 gene. 

20 Table 5 FACS analysis of 293 cells transfected with synBX08gp160 (No 1 and 2) and 
wt.gp16+BX08 (No3 and 4) and stained with monoclonal antibodies to surface expressed 
HIV glycoproteins. A higher expression was obtained with the synthetic gene (mean 48%) as 
compared to the wild type gene (mean 1 1 %). 





50 ul 


45 ul 


A 


B 


C 


C -A 


C-B 


1 


syn.gp160BX08 + 


pBluescript SK+ 


2,57 


2,85 


36,91 


34,34 


34,06 


2 


syn.gp160BX08 + 


pBluescript SK+ 


2,83 


2,14 


58,42 


55,59 


56,28 


3 


wt.gp160BX08 + 


pBluescript SK+ 


1,95 


1,52 


7,51 


5,56 


5,99 


4 


wt.gp160BX08 + 


pBluescript SK+ 


2,97 


1,42 


14,41 


11,44 


12,99 



A: No primary antibody added (control for unspecific secondary Ab binding) 
25 B: Neither Primary Ab nor Secondary Ab added (autoflouroscense control) 
C: Primary Ab and secondary Ab added. 

Example 6D Analyses of the surface expression and biological functionality 

To analyse the surface expression and biological functionality from the wild type and 
30 synthetic BX08 envelope genes transfection experiments were done and cell fusion 
microscopically studied using HIV envelope receptor expressing cells. 
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10 jig of the BX08 envelope plasmid (wt.BX08gp160 or syn.BX08gp160 or empty WRG7079 
vector plasmid) plus 5 fig of a plasmid (pEGFP, Clonetech) expressing green fluorescent 
protein (GFP) were transfected into 2 x 10 6 adherent U87.CD4.CCR5 cells (NIH AIDS Res. & 
Reference program, catalog #4035) stabely expressing CD4 and CCR5, using the CellPhect 
5 transfection kit (Pharmacia). After 48 hours the cells were examined by microscopy and 
photographed (Fig 22c). 

Fig 22c panel A show the negative control (empty WRG7079 plus pGFP) giving no syncytia. 
Panel B show cells transfected with the wild type BX08 gp160 plasmid where cell-to-cell 
fusion (syncytia) is seen. Panel C show cells transfected with the same amounts of 
10 synBX08gp160 plasmid and demonstrating a much higher degree of cell-cell fusion. In fact 
most or all of the cells in the culture plate were fused at this time. This experiment show 
surface expression of functional HIV gp160 with tropism to the CCR5 receptor, as well as a 
much higher expression and biological activity from the synthetic BX08 gene as compared to 
the wild type equivalent. 

15 

Example 7: Gene inoculation of mice for immunization 

6-7 weeks old female BALB/c mice were purchased from Bomholdtgaard, Denmak. 
Microbiological status was conventional and the mice were maintained in groups of 4/5 per 
cage with food and water ad libitum and artificially lighted 12 hours per day. Acclimatization 

20 period was 2 days. Mice were anaesthetized with 0.2 ml i.p. of rohypnol:stesolid (1:3, v/v) 
and DNA inoculated by either i.m. injection of 50 jul 2 mg/ml of plasmid DNA in each tibia 
anterior muscle at week 0, 9, and 15 and terminated week 18; or gene gun inoculated on 
shaved abdominal skin using plasmid coated gold particles (0.95 particles, 2 jag DNA/mg 
gold, 0.5 mg gold/shot, 50-71% coating efficiency) with the hand held Helios® gene gun 

25 device (BioRad) employing compressed (400 psi) Helium as the particle motive force. Mice 
were gene gun vaccinated at week 0, 3, 6, 9, 15, and terminated week 18. 



Example 8: Serological assays 

Western blotting. The induction of a humoral response to gp120 and gp41 antigens by in 
30 vivo expression of the encoded glycoproteins from the synthetic BX08 genes was examined 
by western immuno blotting (Figure 27). Mouse antisera (1 :40) were evaluated in western 
blotting using the commercial HIV BLOT 2.2 strips (Genelabs Diagnostic). The conjugate 
was a 1:200 dilution of the alkaline phosphatase-conjugated rabbit anti-mouse IgG 
(Dakopatts, Glostrup, Denmark). Buffers, incubation condition and colour development were 
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used according to the manufacturer. In these western blotting strips the gp160 band from 
HIV-1 NIB contain of an oligomeric form of gp41 in a higher concentration than the 
monomeric gp41 band on the strip (Genelabs Diagnostic). HIV-1 mB lysate is used in these 
commercial strips where the gp160 band is composed by addition of tetrameric gp41.AII 
5 preimmune sera tested negative in western blotting. Mice inoculated with syn.gp120 B xo8 
showed antibody response to the heterologous gp120 of HIV-1 IIIB. Inclusion of the 
extracellular part of gp41 in the gene syn.gp140 B xo8 induced antibody reaction to both gp120 
and gp41 in all mice. This confirms the in vivo expression of BX08 gp120 and the 
extracellular part of gp41. DNA vaccination with syn.gp160 B xo8 encoding the membrane 
10 bound glycoprotein induced antibodies to gp120 and gp41 in 50% and 64% of the mice, 
respectively. DNA vaccination with syn.gp150 BX 08 induced detectable antibodies to gp120 
and gp41 in 41% and 53%, respectively. Induction of different levels of antibodies could 
explain the difference in numbers of positive reactive mice sera in this qualitative western 
blotting. 

15 ELISA. Mouse anti HIV-1 gp120 antibodies were measured by indirect ELISA. Briefly, wells 
of polystyrene plates Maxisorb (Nunc) were coated for 2 days at room temperature with HIV- 
1 IIIB recombinant gp120 (Intracel) at 0.2 jag/100 \i\ of carbonate buffer, pH 9.6. Before use 
the plates were blocked 1 hour at room temperature with 150 ^l/well of washing buffer (PBS, 
0.5 M NaCI, 1% Triton-X-100) plus 2% BSA and 2% skimmilk powder. After 3 x 1 min. 

20 washings, mouse plasma was added at 100 jul/well diluted in blocking buffer and ELISA 

plates incubated for 90 min. at room temperature using a microtiter plate shaker. As standard 
curve we used a mouse monoclonal antibody to a conserved part of gp120 between V5-C5 
(MRDNWRSELYKY) (#NEA-9301, NEN™ Life Science Products, Inc., Boston, MA). As 
calibration control included on each plate we used a plasma pool from 10 mice vaccinated 

25 with BX08 gp120. Plates were again washed 5x1 min. and incubated 1 hour at room 
temperature with 100 jJ/well of HRP-conjugated rabbit anti-mouse IgG (#P260, Dakopatts, 
Glostrup, Denmark) diluted 1:1000 in blocking buffer. Colour was developed with 100 jul/well 
of peroxidase enzyme substrate consisting of 4 mg of o-phenylenediamine in 1 1 ml water 
plus 4 jjJ hydrogen peroxide (30%, w/w). The enzyme reaction was terminated after 30 min. 

30 by 1 50 jLLl/well of 1 M H 2 S0 4 . The optical density (OD) of wells was measured at 492 nm using 
a microplate photometer (Molecular Devices, Biotech-Line, Denmark). Anti-HIV-gp120 IgG 
titers were expressed as the reciprocal plasma dilution resulting in an OD 49 2nm value of 0.500. 
Mouse anti-HIV-1 BX08 antibodies were also measured by indirect peptide ELISAs as 
described above using a BX08 V3 peptide (SIHIGPGRAFYTTGD) (Schafer, Copenhagen, 

35 Denmark). 
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The IgG antibody response to HIV-1 mB rgp120 quantitated by ELISA is seen in Fig. 28 
and Fig. 29. No background activity was observed in preimmune sera or in sera from 4 mice 
immunized with empty WRG7079 vector in parallel with the BX08 genes. All mice inoculated 
with the synthetic BX08 genes either by gene gun or by i.m. injection responded and showed 
5 a persistent and high titered (about 100-10,000) IgG response to rgp120 as exemplified in 
Fig. 4. When comparing the median titers for groups of mice (Fig. 29) a moderate antibody 
response was observed with the wt.gp160 B xoe. Intramuscular and gene gun immunization 
with a mixture of wtgp160 BX 08 plasmid plus Rev encoding plasmid did not increase this 
antibody response. This was found even when both plasmids were coated onto the same 

10 gold particles to ensure co-transfection of single target cells. However, to ensure inoculation 
of equal amounts of total DNA only half of the amount of wt.BX08 plasmid was used when 
mixing with pRev which may have contributed to the lower antibody response when pRev 
was included. A 5-fold improvement of the antibody response was obtained using the 
syn.gp160 B xo8 gene. This antibody response seemed further improved using the 

1 5 syn.gpl 50 BX o8 gene where the cytoplasmic internalization signals were eliminated but only 
using gene gun inoculation. For both the gene gun inoculation of skin and i.m injection the 
highest antibody titers to rgp120 were induced by genes encoding secreted gp120/gp140 
glycoproteins versus membrane bound gp150/gp160 glycoproteins, respectively. In general, 
equal antibody and ELISA titers to rgp120 were obtained using gene gun and i.m. injection of 

20 the BX08 vaccine genes. 

Example 9: Neutralization assay 

Mouse plasma was diluted in culture medium (RPMI-1640 medium (Gibco) supplemented 
with antibiotics (Gibco), Nystatin (Gibco) and 10% FCS (Bodinco)) and heat inactivated at 
60°C for 30 min. Of the HIV-1 strain BX08 (50 TCID 50 per ml propagated in PBMC) 250 |jl 

25 was incubated for 1 hour at room temperature with 250 pi dilution of mouse serum (four five- 
fold dilutions of mouse serum, final dilutions 1:20 to 1:2500). After incubation 1 x 10 6 PBMC 
in 500 pi culture medium was added to the virus-serum mixture and incubated overnight at 
37°C in 5% C0 2 . Subsequently, eight replicates of 10 5 PBMC in 200 pi culture medium were 
cultured in 96-well culture plates (Nunc) at 37°C in 5% C0 2 . After seven days in culture the 

30 concentration of HIV antigen in the culture supernatant was quantitated using HIV antigen 
detection ELISA (Nielsen et al., 1987). 

This ELISA is performed using human IgG, purified from high titered patient sera, both as 
capture antibody and biotin-linked as detecting antibody. In brief, anti-HIV-capture IgG diluted 
1:4000 in PBS, 100 jil/well, are coated onto Immunoplates (Nunc) overnight at4°C. After 
35 washing five times in washing buffer 100 ^l of supernatants are applied and incubated overnight 
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at 4°C. Plates are washed 5 times before incubation with 100 ]i\ HIV-IgG conjugated with biotin 
diluted 1:1000 in dilution buffer, plus 10% HIV-1 sero-negative human plasma for 3 hours at 
room temperature. Five times 1 min. washing in washing buffer are followed by 30 min. 
incubation with 100 ^l of 1:1000 avidine-peroxidase (Dako P347 diluted in dilution buffer). Six 
5 times 1 min. washings, 5 in washing buffer and the last one are done in dH 2 0 before colour is 
developed with 100 \x\ of peroxidase enzyme substrate consisting of 4 mg of OPD in 1 1 ml 
water plus 4 hydrogen peroxide (30 %, w/w). The enzyme reaction is terminated after 30 
minutes by additional 150 yd of 1M H 2 S0 4 . 

The HIV antigen concentration in cultures, preincubated with mouse serum, was expressed 
10 relatively to cultures without mouse serum (culture medium), and the percentage inhibitions 
of the different dilutions of mouse serum were calculated. The 50% inhibitory concentration 
(IC 50 ) for each mouse serum was determined by interpolation from the plots of percent 
inhibition versus the dilution of serum, and the neutralizing titer of the serum was expressed 
as the reciprocal value of the IC 50 . In each set-up a human serum pool known to neutralise 
15 other HIV-1 strains was included in the same dilutions as the mouse serum as a calibratin 
control. For assay of neutralization of the heterologous SHIV89.6P the MT-2-cell-killing 
format was used (Crawford et al., 1999). The assay stock of SHIV89.6P was grown in human 
PBMC. 

The neutralizing IC 50 antibody titers of plasma pools from 10 mice from each group 
20 were measured at different time points (week 0, 9, and 18). A positive background in some 
preimmune sera and thus in all week 0 serum pools was noted even after dilution and heat 
inactivation that was found earlier to lower this background. In general the neutralizing titers 
to BX08 virus of such serum pools were transient and low ranging from 1 :6-1 :150 above 
background (data not shown). A possible cross-neutralization reaction to a heterologous, 
25 primary HIV-1 envelope was tested using the SHIV89.6P which is relevant in macaque 
models of AIDS and serum pools from mice DNA immunized i.m. with syn.gp140 B xo8- 
Preimmune serum had a titer of 1:37, which is indicative of a slightly positive background, 
whereas the 18 week p.i. serum had a positive neutralizing titer of 1:254 above background. 

30 Example 10: CTL assay 

The cellular immune response in mice following gene gun or i.m. genetic immunization with 
the different vaccine plasmids were examined (Fig. 26). Spleen was removed aseptically and 
gently homogenised to single cell suspension, washed 3 times in RPMI-1640 supplemented 
with 10% FCS and resuspended to a final concentration of 5 x 10 7 cell/ml. The cells were 
35 then incubated 5 days with mitomycin-C treated (50 |ng/ml for 1 hour) mouse P815 (H-2D d ) 
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stimulator cells at a ratio of 10:1 in medium supplemented with 5 x 10' 5 M p- 
mercaptoethanol. For assay of CTL response to HIV-1 BX08, P815 stimulator cells and 
target cells were pulsed with 20 jig/ml of the HIV-1 BX08 V3 peptide containing a conserved 
murine H-2D d restricted CTL epitope (IGPGRAFYTT) (Lapham et al., 1996). After 
5 stimulation, splenocytes were washed three times with RPMI-1640 supplemented with 10% 
FCS and resuspended to a final concentration of 5 x 10 6 cells/ml. 100 jul of cell suspension 
was added in triplicate to U-bottom 96-well microtiter plates and a standard 4 hour 51 Cr- 
release assay performed (Marker et al., 1973). 

All synthetic BX08 plasmids induced a high specific CTL response thus confirming the in vivo 
10 expression and in vivo immunogenicity. The highest CTL response was obtained with 
syn.gp150 BX 08 followed by syn.gp120 B xo8-syn.gp140 B xo8, and syn.gp160 B xo8, respectively. 
Thus, the CTL response induced did not correlate with the antigen being secreted or not. 
However, i.m. DNA immunization with syn.gp150 B xo8 containing six putative CpG motifs 
induced a higher CTL response than gene gun immunization (Fig. 26). This difference could 
1 5 be explained by the high amount of DNA used in the i.m. injections. 

The T-lymphocyte cytokine profile of spleen cells after ConA stimulation as well as serum 
antibody IgGza/IgG! at week 18 were investigated. Neither the IFNy/IL-4 nor the lgG 2a /lgG 1 
ratios, which both reflects a Th1-type of immune response, were significantly higher for the 
i.m. immunized mice when compared with gene gun immunized mice (student t-test and 
20 Mann-Withney U-test). Thus, the CTL response did not correlate with a certain Th-type of 
response and the DNA immunization technique did not bias the immune response using 
synthetic BX08 genes. 



Example 11: Antibody responses to DNA vaccination with synBX08 env plasmid 

25 A relatively low and variable antibody response (1 of 10 mice) was obtained with gene gun 
inoculation of the syn.gp140BX08 plasmid vaccine starting at week 9, figure 23, right panel. 
A higher numbers of responders 3/10 with high lgG1 antibody responses at an earlier onset 
(week 3-9) was obtained with the syn.gp140BX08 plasmid using i.m. injection, left panel. 
Sera from later time points may show more responders and/or higher titers but are not 

30 assayed. However, these results show the induction of an antibody response to the BX08 V3 
peptide by DNA vaccination using one of the described synthetic BX08 constructs. 
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Claims 

1 . A method for producing a nucleotide sequence construct comprising the following steps: 

a) obtaining a first nucleotide sequence of an HIV gene from a patient within the first 12 
5 months of infection; 

b) designing a second nucleotide sequence utilising the most frequent codons from 
mammalian highly expressed proteins to encode the same amino acid sequence as the 
first nucleotide sequence of a) encodes 

c) redesigning the second nucleotide sequence of b) so that restriction enzyme sites 

10 surrounds the regions of the nucleotide sequence which encode functional regions of the 
amino acid sequence and so that selected restriction enzyme sites are removed thereby 
obtaining a third nucleotide sequence encoding the same amino acid sequence as the 
first and the second nucleotide sequence of a) and b) encode; 

d) redesigning the third nucleotide sequence of c) so that the terminal snuts contain 
15 convenient restriction enzyme sites for cloning into an expression vehicle; 

e) producing the snuts between restriction enzyme sites of c) and terminal snuts of d); 

f) assembling the snut of step e) to form a nucleotide sequence construct. 

2. A method according to claim 1 , wherein the HIV gene is the gene encoding the envelope. 

20 

3. A method according to claim 1 or 2, wherein the HIV gene encodes one or more Gag 
proteins. 

4. A method according to any of the preceding claims, wherein the HIV in step a) is in group 
25 M, O or N 

5. A method according to claim 4, wherein the HIV is a group M virus. 

6. A method according to any of the preceding claims, wherein the HIV is subtype A, B, C, D, 
30 E, F, G, H, I, or J. 

7. A method according to claim 6, wherein the HIV is subtype B. 

8. A method according to any of the preceding claims wherein the first nucleotide sequence 
35 is obtained by direct cloning. 
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9. A method according to any of the preceding claims, wherein the HIV in step a) is isolated 
with the first 11 months of infection, such a 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, or 0.5 month after 
infection. 

5 

10. A method according to any of the preceding claims, wherein the redesigning in step c) is 
carried out after the second nucleotide sequence of step b) has been divided into pieces, 
so that each piece comprises only different restriction enzyme sites. 

10 1 1 . A method according to claim 10, wherein the second nucleotide sequence of step b) is 
divided into 9 pieces, or 8, or 7, or 6, or 5, or 4, or 3, or 2 pieces. 

12. A method according to claim 11, wherein the second nucleotide sequence of step b) is 
divided into 3 pieces. 

15 

13. A method according to any of the preceding claims, wherein the second nucleotide 
sequence of step b) is designed utilising the most frequent codons from human highly 
expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence of step a) encodes. 

20 

14. A nucleotide sequence construct obtainable by the method of any of claims 1-13. 

15. A nucleotide sequence construct according to claim 14, wherein the nucleotide sequence 
encoding the amino acid sequence in the first variable region is surrounded by EcoRV 

25 and Pst\ restriction enzyme sites. 

16. A nucleotide sequence construct according to claims 14 or 15, wherein the nucleotide 
sequence encoding the amino acid sequence in the second variable region is surrounded 
by Pst\ and C/al restriction enzyme sites. 

30 

17. A nucleotide sequence construct according to any of claims 14-16, wherein the 
nucleotide sequence encoding the amino acid sequence in the third variable region is 
surrounded by C/al and EcoRI restriction enzyme sites. 
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18. A nucleotide sequence construct according to any of claims 14-17, wherein the 

nucleotide sequence encoding the amino acid sequence in the transmembrane spanning 
region is surrounded by HindWl and Sad I restriction enzyme sites. 

5 19. A nucleotide sequence construct according to any of claims 14-18, wherein the 

nucleotide sequence encoding the amino acid sequence on both sites of the cleavage site 
is surrounded by Pstt and Xba\ restriction enzyme sites. 

20. A nucleotide sequence construct in isolated form which has a nucleotide sequence with 
0 the general formula (I), (II), (III), or (IV) or subsequences thereof 



(I) 


Pi 


"S495CJa|- 


-S 6 50-720EcoRr 


-P 2 -S 12 65gp120 




(II) 


Pr 


-S495Clal' 


"S650-720ECORI" 


•P 2 -Si265Xhor Si465Pstr 


P4gp140 


(III) 


Pr 


"S495Clar 


■S650-720ECORI" 


•P2-S 12 65Xhor Si465Pstr 


P4gp150 


(IV) 


Pr 


~S495Clar 


"S650-720EcoR|- 


•P2~S 126 5Xhor S^espstr 


P4gp160" S 20 60Sacir P5 



5 wherein P t designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 

complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein S 495 ciai designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 
95%thereto; 

wherein S 65 o-72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

wherein P 2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein S 1265 gp 12 o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 70% thereto; 

wherein S 1265 xhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein Si 46 5Psti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 
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wherein P 4 g P i4o designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i6o designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S 2 o6osacn designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 
wherein P 5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 

15 thereto. 

21. A nucleotide sequence construct according to claim 20, with the formula (I) 

( I ) Pi -S 49 5ClarS650-720EcoRr P2-S 1 265gp1 20 

20 22. A nucleotide sequence construct according to claim 20, with the formula (II) 

(II) Pl-S495ClarS650-720EcoRrP2"*S 12 65Xhor Si465Pstl" P4gp140 

23. A nucleotide sequence construct according to claim 20, with the formula (III) 

(III) Pl-S495ClarS650-720EcoR|-P2-Si265Xhor S 146 5Pstr P4gp150 

25 

24. A nucleotide sequence construct according to claim 20, with the formula (IV) 

(IV) Pl-S 4 95ClarS650-720EcoRrP2-Si265Xhor S 146 5Pstr P4gp16(T S 2 060Sacir P5 

25. A nucleotide sequence construct according to claim 20 consisting essentially of the 
30 subsequence P^ 

26. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 49 5ciai. 

35 27. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 6 5o-72oecori- 



WO 00/29561 



55 



PCT/DKOO/00144 



28. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 2 . 

29. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 12 65gpi2o. 

30. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S^esxnoi- 

31 . A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 1465 psti. 

32. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4gp i4o- 

33. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4gp i5 0 . 

34. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4gp i6o. 

35. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 2 o6osacii 

36. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 5 . 

37. A nucleotide sequence construct with a sequence identity of more than 85% to the 
nucleotide sequence construct in any of claims 20-35. 

38. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
more than 90% such as more than 95%, 98%, or 99%. 

39. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
100%. 
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40. A nucleotide sequence construct according to any of claims 14-39, coding for an HIV 
envelope or parts thereof with an improved immunogenicity obtained by mutating the 
nucleotide sequence construct of any of claims 14-39 such that one or more glycosylation 

5 sites in the amino acid sequence have been removed. 

41. A nucleotide sequence construct according to claim 40 with a mutation at positions 
A307C + C309A and/or A325C + C327G and/or A340C + C342A and/or A385C + C387A 
and/or A469C + C471A or any combination of those. 

10 

42. A nucleotide sequence construct according to any of claims 14-41, coding for an HIV 
envelope or parts thereof with a binding site for the CXCR4 co-receptor in the third 
variable region. 

15 43. A nucleotide sequence construct according to claim 42 with a mutation at positions 
G865C + A866G. 

44. A nucleotide sequence construct according to any of claims 14-43, coding for an HIV 
envelope or parts thereof, wherein an immunodominant epitope has been modified. 

20 

45. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope in the third variable region has been modified. 

46. A nucleotide sequence construct according to claim 45 with a deletion of nucleotides 
25 793-897. 

47. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope has been removed from gp41 . 

30 48. A nucleotide sequence construct according to any of claims 14-47, coding for an HIV 
envelope or parts thereof, wherein the cleavage site between gp41 and gp120 is 
removed. 

49. A nucleotide sequence construct according to claim 48 with a mutation at position 
35 C1423A. 
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50. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 1t S 4 95ci a i> S 6 5o-72oecori, arid P 2 . 

51 . A nucleotide sequence construct according to claim 20 consisting essentially of the 
5 subsequence S 12 65Xhoi, S 146 5Pstn and P4 gp i4o- 

52. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Si265Xhoi» Si465Psti, P4gpi6o, S 2 o6osacib and P5. 

10 53. A nucleotide sequence construct according to any of claims 14-52, further comprising a 
nucleotide sequence repeat coding for a functional region of the amino acid sequence. 

54. A nucleotide sequence construct according to claim 53, wherein the nucleotide sequence 
repeat codes for amino acids in the third variable region. 

15 

55. A nucleotide sequence construct according to any of claim 14-54, further comprising a 
nucleotide sequence coding for a T-helper cell epitope containing sequence. 

56. An expression vehicle selected from a group of viral vectors consisting of simliki forest 
20 virus (sfv), adenovirus and Modified Vaccinia Virus Ankara (MVA), further comprising a 

nucleotide sequence construct according to any of claim 14-55. 

57. A method of individualised immunotherapy wherein the virus from a newly diagnosed 
patient is directly cloned, the envelope is produced with highly expressed codons, inserted 

25 into any of the nucleotide sequence constructs of claims 14-55, and administered to the 
patient. 

58. Use of a nucleotide sequence construct according to any of claims 14-55 in medicine. 

30 59. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
vaccine for the prophylactics of infection with HIV in humans. 

60. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
composition for the treatment of an HIV infection in a human within 24 weeks of primary 
35 infection. 
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61 . Use of the nucleotide sequence according to any of claims 14-55 for the production of a 
recombinant protein. 
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SEQUENCE LISTING 
<110> Statens Serum Institut 

<120> Method for producing an nucleotide 

sequence construct with optimised codons for an HIV envelope 
based on a primary, clinical HIV isolate and the BX08 
construct . 

<130> 21924DK1 

<160> 76 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 243 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (243) 

<400> 1 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

1 5 10 " " 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 4 0 4 5 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate 
lie 



48 



96 



144 



192 



240 



243 



<210> 2 
<211> 81 
<212> PRT 

<213> Artificial Sequence 
<400> 2 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
1 5 10 " 15 
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Val 
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40 








45 
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Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 








Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin Met 


His 


Glu 


Asp 


65 










70 










75 








80 


He 































<210> 3 
<211> 143 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (143) 

<400> 3 

gat ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg 48 

Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu 

15 10 15 

acc ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc 96 
Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr 
20 25 30 

gac acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc ag 143 
Asp Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys 
35 40 45 



<210> 4 
<211> 47 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu 

15 10 15 

Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr 

20 25 30 

Asp Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys 
35 40 45 

<210> 5 
<211> 132 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (132) 

<400> 5 

ctg cag ctt caa cat cag cac cag cgt gcg caa caa gat gaa gcg cga 4 8 

Leu Gin Leu Gin His Gin His Gin Arg Ala Gin Gin Asp Glu Ala Arg 
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15 10 15 

gta cgc cct gtt eta cag cct gga cat cgt gec cat cga caa cga caa 96 
Val Arg Pro Val Leu Gin Pro Gly His Arg Ala His Arg Gin Arg Gin 
20 25 30 

cac cag eta ccg cct gcg cag ctg caa cac ate gat 132 
His Gin Leu Pro Pro Ala Gin Leu Gin His lie Asp 
35 40 



<210> 6 
<211> 44 
<212> PRT 

<213> Artificial Sequence 
<400> 6 

Leu Gin Leu Gin His Gin His Gin Arg Ala Gin Gin Asp Glu Ala Arg 

1 5 10 15 

Val Arg Pro Val Leu Gin Pro Gly His Arg Ala His Arg Gin Arg Gin 

20 25 30 

His Gin Leu Pro Pro Ala Gin Leu Gin His lie Asp 
35 4 0 

<210> 7 
<211> 161 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (161) 

<400> 7 

ate gat cat cac cca ggc ctg ccc caa ggt gag ctt cga gec cat ccc 48 

lie Asp His His Pro Gly Leu Pro Gin Gly Glu Leu Arg Ala His Pro 

15 10 15 

cat cca ctt ctg cgc ccc cgc egg ctt cgc cat cct gaa gtg caa caa 96 
His Pro Leu Leu Arg Pro Arg Arg Leu Arg His Pro Glu Val Gin Gin 
20 25 30 



caa gac ctt caa egg cac egg ccc ctg cac caa cgt gag cac cgt gca 
Gin Asp Leu Gin Arg His Arg Pro Leu His Gin Arg Glu His Arg Ala 
35 4 0 4 5 



144 



gtg cac cca egg aat tc 161 
Val His Pro Arg Asn 
50 



<210> 8 
<211> 53 
<212> PRT 

<213> Artificial Sequence 
<400> 8 

lie Asp His His Pro Gly Leu Pro Gin Gly Glu Leu Arg Ala His Pro 
15 10 15 
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His Pro Leu Leu Arg Pro Arg Arg Leu Arg His Pro Glu Val Gin Gin 

20 25 30 

Gin Asp Leu Gin Arg His Arg Pro Leu His Gin Arg Glu His Arg Ala 

35 40 45 

Val His Pro Arg Asn 
50 



<210> 9 
<211> 254 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (254) 



<400> 9 

gaa ttc gcc ccg tgg tga gca ccc age tgc tgc tga acg gca gec tgg 48 

Glu Phe Ala Pro Trp * Ala Pro Ser Cys Cys * Thr Ala Ala Trp 
15 10 



ccg agg agg agg tgg tga tea gat ctg aga act tea cca aca acg cca 96 
Pro Arg Arg Arg Trp * Ser Asp Leu Arg Thr Ser Pro Thr Thr Pro 
15 20 25 



aga cca tea teg tgc age tga acg aga gcg tgg aga tea act gca ccc 14 4 

Arg Pro Ser Ser Cys Ser * Thr Arg Ala Trp Arg Ser Thr Ala Pro 
30 35 40 



gcc cca aca aca aca ccc gca aga gca tec aca teg gcc ctg gcc gcg 192 
Ala Pro Thr Thr Thr Pro Ala Arg Ala Ser Thr Ser Ala Leu Ala Ala 
45 50 55 60 



cct tct aca cca ccg gcg aca tea teg gcg aca tec gcc agg ccc act 240 
Pro Ser Thr Pro Pro Ala Thr Ser Ser Ala Thr Ser Ala Arg Pro Thr 
65 70 75 



gca aca tct eta ga 
Ala Thr Ser Leu 
80 



254 



<210> 10 
<211> 80 
<212> PRT 

<213> Artificial Sequence 



<400> 10 
Glu Phe Ala Pro Trp Ala Pro Ser 

1 5 
Arg Arg Trp Ser Asp Leu Arg Thr 
20 

Ser Cys Ser Thr Arg Ala Trp Arg 

35 40 
Thr Pro Ala Arg Ala Ser Thr Ser 

50 55 
Pro Ala Thr Ser Ser Ala Thr Ser 
65 70 



Cys Cys Thr Ala Ala Trp Pro Arg 

10 15 
Ser Pro Thr Thr Pro Arg Pro Ser 
25 30 
Ser Thr Ala Pro Ala Pro Thr Thr 
45 

Ala Leu Ala Ala Pro Ser Thr Pro 
60 

Ala Arg Pro Thr Ala Thr Ser Leu 
75 80 
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<210> 11 
<211> 92 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (92) 



<400> 11 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 4 8 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
5 1-0 15 

cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tc 92 

Arg Glu Lys Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser 
20 25 30 



<210> 12 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 

15 10 15 

Arg Glu Lys Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser 

20 25 30 

<210> 13 
<211> 130 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (130) 

<400> 13 

gag etc egg egg cga ccc cga gat cgt gat gca cag ctt caa ctg egg 4 8 

Glu Leu Arg Arg Arg Pro Arg Asp Arg As.p Ala Gin Leu Gin Leu Arg 
1 5 10 15 

egg cga gtt ctt eta ctg caa cac cac cca get gtt caa cag cac ctg 96 
Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 
20 25 30 

gaa cga gac caa cag cga ggg caa cat cac tag t 130 
Glu Arg Asp Gin Gin Arg Gly Gin His His * 
35 40 



<210> 14 

<211> 42 

<212> PRT 

<213> Artificial 



Sequence 



<400> 14 
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Glu Leu Arg Arg Arg Pro Arg Asp Arg Asp Ala Gin Leu Gin Leu Arg 

1 5 10 15 

Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 

20 25 30 

Glu Arg Asp Gin Gin Arg Gly Gin His His 
35 40 

<210> 15 
<211> 164 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) ... (164) 

<400> 15 

act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag ate ate aac 48 

Thr Ser Gly Thr lie Thr Leu Pro Cys Arg He Lys Gin He He Asn 
15 10 15 

atg tgg cag gag gtg ggc aag gec atg tac gec ccc ccc ate ggc ggc 96 
Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro lie Gly Gly 
20 25 30 

cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg acc cgc gac 144 
Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
35 40 45 

ggc ggc age gac aac teg ag 164 
Gly Gly Ser Asp Asn Ser 
50 



<210> 16 
<211> 54 
<212> PRT 

<213> Artificial Sequence 





<400> 


16 


























Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 


He 


He 


Asn 


1 








5 










10 










15 




Met 


Trp 


Gin 


Glu 
20 


Val 


Gly 


Lys 


Ala 


Met 
25 


Tyr 


Ala 


Pro 


Pro 


He 
30 


Gly 


Gly 


Gin 


He 


Lys 
35 


Cys 


Leu 


Ser 


Asn 


He 
40 


Thr 


Gly 


Leu 


Leu 


Leu 
45 


Thr 


Arg 


Asp 


Gly 


Gly 
50 


Ser 


Asp 


Asn 


Ser 























<210> 17 
<211> 200 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (200) 



<400> 17 



WO 00/29561 



PCT/DKOO/00144 



etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

crg-a gaa gcg cgc cgt ggg cat egg cgc— tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc ag 200 
Arg Cys 
65 



<210> 18 
<211> 66 
<212> PRT 

<213> Artificial Sequence 
<400> 18 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

1 5 10 " 15 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

Arg Cys 
65 

<210> 19 
<211> 212 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (212) 

<400> 19 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 48 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 



WO 00/29561 



8 



PCT/DK00/00144 



cga gaa gcg cgc eta ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 

Arg Glu Ala Arg Leu Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

cgc tgc age ccg ggg gat cc 212 

Arg Cys Ser Pro Gly Asp 

65 7 0 





<210> 


20 




























<211> 


70 




























<212> 


PRT 




























<213> 


Artificial Sequence 




















<400> 


20 


























Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His 


Ala 


1 








5 










10 










15 




Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 








20 










25 










30 






Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly Ala 


Ala 






35 










40 










45 








Arg 


Glu 


Ala 


Arg 


Leu 


Gly 


His 


Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 




50 










55 










60 










Arg 


Cys 


Ser 


Pro 


Gly 


Asp 






















65 










70 
























<210> 


21 




























<211> 


200 




























<212> 


DNA 




























<213> 


Artificial Sequence 




















<220> 






























<221> 


CDS 




























<222> 


(1) • 


. . . (200) 
























<400> 


21 


























etc 


gag 


cag 


egg 


caa 


gga 


gat 


ttt 


ccg 


ccc 


egg 


egg 


egg 


cga 


cat 


gcg 


Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His 


Ala 


1 








5 










10 










15 




cga 


caa 


ctg 


gcg 


cag 


cga 


get 


gta 


caa 


gta 


caa 


ggt 


ggt 


gaa 


gat 


cga 


Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 








20 










25 










30 






gec 


cct 


ggg 


cat 


cgc 


ccc 


cac 


caa 


ggc 


caa 


gcg 


ccg 


cgt 


ggt 


gca 


gcg 


Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly 


Ala 


Ala 






35 










40 










45 








cga 


gaa 


gag 


cgc 


cgt 


ggg 


cat 


egg 


cgc 


tat 


gtt 


cct 


egg 


ctt 


cct 


ggg 


Arg 


Glu 


Glu 


Arg 


Arg 


Gly 


His 


Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 




50 










55 










60 











48 



96 



144 



192 



cgc tgc ag 200 
Arg Cys 
65 



<210> 22 



WO 00/29561 



PCT/DKOO/00144 



9 



<211> 66 
<212> PRT 

<213> Artificial Sequence 



<400> 22 
Leu Glu Gin Arg 
1 

Arg Gin Leu Ala 
20 

Ala Pro Gly His 
35 

Arcr Glu Glu Arg 

50 
Arg Cys 
65 



Gin Gly Asp Phe 
5 

Gin Arg Ala Val 

Arg Pro His Gin 
40 

Arg Gly His Arg 
55 



Pro Pro Arg Arg 
10 

Gin Val Gin Gly 
25 

Gly Gin Ala Pro 

Arg Tyr Val Pro 
60 



Arg Arg His Ala 
15 

Gly Glu Asp Arg 
30 

Arg Gly Ala Ala 
45 

Arg Leu Pro Gly 



<210> 23 
<211> 178 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (178) 

<400> 23 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc agg 4 8 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cys Arg 
15 10 



ccc gcc age tgc tga gcg gca teg tgc age age aga aca acc tgc tgc 96 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 



gcg cca teg agg ccc age age acc tgc tec age tga ccg tgt ggg gca 144 
Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 



tea age age tec agg ccc gcg tgc tgg etc tag a 178 
Ser Ser Ser Ser Arg Pro Ala Cys Trp Leu * 
45 50 



<210> 24 
<211> 54 
<212> PRT 

<213> Artificial Sequence 



<400> 24 

Leu Gin Ala Ala Pro Trp Ala Pro 

1 5 
Ser Cys Ala Ala Ser Cys Ser Ser 
20 

Arg Pro Ser Ser Thr Cys Ser Ser 

35 40 
Arg Pro Ala Cys Trp Leu 
50 



Pro Ala Pro Pro Cys Arg Pro Ala 

10 15 
Arg Thr Thr Cys Cys Ala Pro Ser 
25 30 
Pro Cys Gly Ala Ser Ser Ser Ser 
45 



<210> 25 



WO 00/29561 



10 



PCT/DKOO/00144 



<211> 178 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (178) 

<400> 25 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc agg 48 
Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cvs Ara 
1 5 10 

ccc gcc age tgc tga gcg gca teg tgc age age aga aca acc tgc tgc 96 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 



gcg cca teg agg ccc age age acc tgc tec age tga ccg tgt ggg gca 144 
Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 



tea age agt get gcg gcc gcg tgc tgg etc tag a 178 
Ser Ser Ser Ala Ala Ala Ala Cys Trp Leu * 
4 5 50 



<210> 26 
<211> 54 
<212> PRT 

<213> Artificial Sequence 



<400> 26 



Leu 


Gin 


Ala 


Ala 


Pro 


Trp 


Ala 


Pro 


1 








5 








Ser 


Cys 


Ala 


Ala 
20 


Ser 


Cys 


Ser 


Ser 


Arg 


Pro 


Ser 
35 


Ser 


Thr 


Cys 


Ser 


Ser 
40 


Ala 


Ala 


Ala 


Cys 


Trp 


Leu 







50 



Pro Ala Pro Pro Cys Arg Pro Ala 

10 15 

Arg Thr Thr Cys Cys Ala Pro Ser 
25 30 

Pro Cys Gly Ala Ser Ser Ser Ala 
45 



<210> 27 
<211> 77 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (77) 



<400> 27 

tct aga gcg eta cct cca gga cca gcg 
Ser Arg Ala Leu Pro Pro Gly Pro Ala 
1 5 



ctt cct ggg cat gtg ggg ctg 4 8 

Leu Pro Gly His Val Gly Leu 
10 15 



etc egg caa get gat ctg cac cac ggc eg 77 
Leu Arg Gin Ala Asp Leu His His Gly 

20 25 



WO 00/29561 



11 
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<210> 28 
<211> 25 
<212> PRT 

<213> Artificial Sequence 



<400> 28 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

1 5 10 15 

Leu Arg Gin Ala Asp Leu His His Gly 
20 2 5 



<210> 29 
<211> 190 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<221> CDS 

<222> (1) . . . (190) 



<400> 29 

egg ccg tgc cct gga acg cca get gga gca aca aga acc tga gec aga 

Arg Pro Cys Pro Gly Thr Pro Ala Gly Ala Thr Arg Thr * Ala Arg 

15 10 15 



ttt ggg aca aca tga cct gga tgg agt ggg age gcg aga tea gca act 
Phe Gly Thr Thr * Pro Gly Trp Ser Gly Ser Ala Arg Ser Ala Thr 

20 25 30 



aca ccg aga tea tct aca gec tga teg agg aga gee aga acc age agg 
Thr Pro Arg Ser Ser Thr Ala * Ser Arg Arg Ala Arg Thr Ser Arg 
35 40 45 



aga aga acg age tgg acc tgc tec age tgg aca agt ggg caa get t 190 
Arg Arg Thr Ser Trp Thr Cys Ser Ser Trp Thr Ser Gly Gin Ala 
50 55 60 



<210> 30 
<211> 60 
<212> PRT 

<213> Artificial Sequence 



<400> 30 
Arg Pro Cys Pro 
1 

Gly Thr Thr Pro 
20 

Arg Ser Ser Thr 
35 

Ser Trp Thr Cys 
50 



Gly Thr Pro Ala 
5 

Gly Trp Ser Gly 

Ala Ser Arg Arg 
40 

Ser Ser Trp Thr 
55 



Gly Ala Thr Arg 
10 

Ser Ala Arg Ser 
25 

Ala Arg Thr Ser 

Ser Gly Gin Ala 
60 



Thr Ala Arg Phe 
15 

Ala Thr Thr Pro 
30 

Arg Arg Arg Thr 
45 



<210> 31 
<211> 177 
<212> DNA 

<213> Artificial Sequence 



WO 00/29561 



12 
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<220> 

<221> CDS 

<222> (1)...<177) 

<400> 31 

aag ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg gta cat caa 4 8 

Lys Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin 
15 10 15 

gat ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt 96 
Asp Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val 
20 25 30 

cac cgt get gag cat cgt gaa ccg cgt gcg cca ggg eta cag ccc cct 14 4 

His Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro 
35 40 45 

gag ctt cca gac ccg cct gee cgt gee ccg egg 177 
Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
50 55 





<210> 


32 
























<211> 


59 
























<212> 


PRT 
























<213> 


Artificial Sequence 




















<400> 


32 






















Lys 


Leu Val 


Glu 


Leu 


Val Gin His 


His 


Gin 


Leu 


Ala 


Val 


Val 


His 


Gin 


1 






5 






10 










15 




Asp 


Phe His 


His 


Asp 


Arg Gly Arg 


Pro 


Asp 


Arg 


Pro 


Ala 


His 


Arg 


Val 






20 






25 










30 






His 


Arg Ala 


Glu 


His 


Arg Glu Pro 


Arg 


Ala 


Pro 


Gly 


Leu 


Gin 


Pro 


Pro 




35 






40 










45 








Glu 


Leu Pro 


Asp 


Pro 


Pro Ala Arg 


Ala 


Pro 


Arg 














50 






55 



















<210> 33 
<211> 140 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (140) 

<400> 33 

ccg egg ccc cga ccg ccc cga ggg cat cga gga gga ggg egg cga gcg 4 8 

Pro Arg Pro Arg Pro Pro Arg Gly His Arg Gly Gly Gly Arg Arg Ala 
15 10 15 

cga ccg cga ccg cag cac ccg cct ggt gac egg ctt cct gee cct gat 96 
Arg Pro Arg Pro Gin His Pro Pro Gly Asp Arg Leu Pro Ala Pro Asp 
20 25 30 



ctg gga cga cct gcg cag cct gtt cct gtt cag eta cca teg at 140 
Leu Gly Arg Pro Ala Gin Pro Val Pro Val Gin Leu Pro Ser 
35 40 45 



WO 00/29561 
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<210> 34 
<211> 46 
<212> PRT 

<213> Artificial Sequence 
<400> 34 

Pro Arg Pro Arg Pro Pro Arg Gly His Arg Gly Giy Gly Arg Arg Ala 

15 10 15 

Arg Pro Arg Pro Gin His Pro Pro Gly Asp Arg Leu Pro Ala Pro Asp 

20 25 30 

Leu Gly Arg Pro Ala Gin Pro Val Pro Val Gin Leu Pro Ser 
35 40 45 

<210> 35 
<211> 129 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (129) 

<400> 35 

ate gat tgc gcg acc tgc tgc tga teg tgg ccc gca teg tgg age tgc 4 8 

lie Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 
15 10 15 

tgg gec ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tec 96 
Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 
20 25 30 

agt act gga gec agg age tga aga act ctg cag 129 
Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin 
35 40 





<210> 


36 




















<211> 


40 




















<212> 


PRT 




















<213> 


Artificial Sequence 




















<400> 


36 


















He 


Asp Cys 


Ala Thr Cys Cys Ser 


Trp 


Pro 


Ala 


Ser 


Trp 


Ser 


Cys 


Trp 


1 




5 




10 










15 




Ala 


Gly Ala 


Ala Gly Arg Ser Ser 


Thr 


Gly 


Gly 


Thr 


Cys 


Ser 


Ser 


Thr 






20 


25 










30 






Gly 


Ala Arg 


Ser Arg Thr Leu Gin 



















35 40 



<210> 37 
<211> 114 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (114) 



WO 00/29561 
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<400> 37 

ctg cag tga gcc tgc tga acg cca ccg cca teg ccg tgg ccg agg gca 4 8 

Leu Gin * Ala Cys * Thr Pro Pro Pro Ser Pro Trp Pro Arg Ala 

1 5 10 

ccg acc gcg tga teg agg tgg tgc age gca tct ggc gcg gca tec tgc 96 
Pro Thr Ala * Ser Arg Trp Cys Ser Ala Ser Gly Ala Ala Ser Cys 
15 20 25 

aca tec cca ccc gaa ttc 114 
Thr Ser Pro Pro Glu Phe 
30 35 





<210> 


38 


















<211> 


35 


















<212> 


PRT 


















<213> 


Artificial Sequence 


















<400> 


38 
















Leu 


Gin Ala 


Cys Thr Pro Pro Pro 


Ser Pro 


Trp 


Pro 


Arg 


Ala 


Pro 


Thr 


1 




5 


10 










15 




Ala 


Ser Arg 


Trp Cys Ser Ala Ser 


Gly Ala 


Ala 


Ser 


Cys 


Thr 


Ser 


Pro 






20 


25 








30 






Pro 


Glu Phe 



















35 



<210> 39 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)...(41) 

<400> 39 

gaa ttc gcc agg get teg age gcg ccc tgc tgt aag gat cc 41 
Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
1 5 10 



<210> 40 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<400> 40 

Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
1 5 10 

<210> 41 
<211> 506 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (506) 



WO 00/29561 



PCT7DK00/00144 



15 



<400> 41 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 4 8 

Ala Ser Ala Ala Asp Arq Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 

65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 432 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 

145 150 155 160 

cgc ctg cgc age tgc aac aca teg at 506 

Arg Leu Arg Ser Cys Asn Thr Ser 
165 



<210> 42 
<211> 168 
<212> PRT 

<213> Artificial Sequence 
<400> 42 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

1 5 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 
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16 



PCT/DK00/00144 



Pro 


Thr 
bu 


Asp 


Pro 


Asn 


Pro 


Gin 
55 


Glu 


Val 


Val 


Leu 


Gly 
60 


Asn 


Val 


Thr 


Glu 


Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


DO 










70 










75 










80 


lie 


lie 


Ser 


Leu 


Trp 
85 


Asp 


Gin 


Ser 


Leu 


Lys 
90 


Pro 


Cys 


Val 


Lys 


Leu 
95 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 




Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 






Asn 


lie 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


ILe-^Asp Asn Asp Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 



















165 



<210> 43 
<211> 374 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (374) 



<400> 43 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 48 
Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
1 5 10 15 



cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tec ggc 96 
Arg Glu Lys Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly 
20 25 30 



ggc gac ccc gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc 144 

Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe 

35 4 0 4 5 

ttc tac tgc aac acc acc cag ctg ttc aac age acc tgg aac gag acc 192 

Phe Tyr Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr 

50 55 60 



aac age gag ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc 240 
Asn Ser Glu Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg 
65 70 75 80 

ate aag cag ate ate aac atg tgg cag gag gtg ggc aag gcc atg tac 288 
He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr 
85 90 ~ 95 



gcc ccc ccc ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc 336 
Ala Pro Pro lie Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly 
100 105 HO 



ctg ctg ctg acc cgc gac ggc ggc age gac aac teg ag 
Leu Leu Leu Thr Arg Asp Gly Gly Ser Asp Asn Ser 
115 120 



374 
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<210> 44 
<211> 124 
<212> PRT 

<213> Artificial Sequence 
<400> 44 



Ser 


Arg 


Thr 


Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


val 


Ai a 


IjXU 


Lys 


Leu 


1 








5 










10 










15 




Arg 


Glu 


Lys 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 








20 










25 










30 






Glv 


Asp 


Pro 


Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 






35 










40 










45 








Phe 


Tyr 


Cys 


Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 




50 










55 










60 










Asn 


Ser 


Glu 


Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


65 










70 










75 










80 


lie 


Lys 


Gin 


He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 










85 










90 










95 




Ala 


Pro 


Pro 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 








100 










105 










110 






Leu 


Leu 


Leu 


Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 











115 120 



<210> 45 
<211> 1277 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1277) 

<400> 45 
get age gcg gec gac cgc 
Ala Ser Ala Ala Asp Arg 
1 5 

gtg tgg aag gac gec acc 

Val Trp Lys Asp Ala Thr 
20 

gec tac gac acc gag gtg 

Ala Tyr Asp Thr Glu Val 
35 

ccc acc gac ccc aac ccc 

Pro Thr Asp Pro Asn Pro 
50 

aac ttc aac atg ggc aag 

Asn Phe Asn Met Gly Lys 

65 70 

ate ate age ctg tgg gac 
He He Ser Leu Trp Asp 

85 



ctg tgg gtg 
Leu Trp Val 



acc acc ctg 
Thr Thr Leu 
25 

cac aac gtg 
His Asn Val 
40 

cag gag gtg 
Gin Glu Val 
55 

aac aac atg 
Asn Asn Met 



cag age ctg 
Gin Ser Leu 



acc gtg tac 
Thr Val Tyr 
10 

ttc tgc gec 
Phe Cys Ala 



tgg gec acc 
Trp Ala Thr 



gtg ctg ggc 
Val Leu Gly 
60 

gtg gag cag 
Val Glu Gin 
75 

aag ccc tgc 
Lys Pro Cys 
90 



tac ggc gtg 
Tyr Gly Val 
15 

age gac gec 
Ser Asp Ala 
30 

cac gcg tgc 
His Ala Cys 
45 

aac gtg acc 
Asn Val Thr 



atg cac gag 
Met His Glu 



gtg aag ctg 
Val Lys Leu 
95 



ccc 4 8 

Pro 



aag 96 
Lys 



gtg 144 
Val 



gag 192 
Glu 



gat 240 
Asp 
80 

acc 288 
Thr 



ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 



336 
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Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 " 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
•Phe Tyr Ser Leu Asp lie Val Pro Il-e- Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gec ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age ace cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 720 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 
lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 
Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 • 270 

cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 864 
Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 
Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 
Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 
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gag ate gtg atg cac 

Glu He Val Met His 
340 

aac acc acc cag ctg 

Asn Thr Thr Gin Leu 
355 

ggc aac ate act agt 

Gly Asn He Thr Ser 
370 

strc ate aac atg tgg 

He He Asn Met Trp 
385 

ate ggc ggc cag ate 

He Gly Gly Gin He 
405 

acc cgc gac ggc ggc 

Thr Arg Asp Gly Gly 
420 



<210> 46 
<211> 425 
<212> PRT 
<213> Artificial Sequence 



<400> 46 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










215 










220 











age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 
Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
345 350 

ttc aac age acc tgg aac gag acc aac age gag 1104 
Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
360 365 

ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 
Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
375 380 

cag gag gtg ggcr^aag gee atg tac gec ccc ccc 1200 
Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
390 395 400 

aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
410 415 

age gac aac teg ag 1277 
Ser Asp Asn Ser 
4 25 
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Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


He 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 










245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


He 


He 


Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 


•Ffcre 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asrr-Gln 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 










325 










330 










335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 










He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 

















420 425 

<210> 47 

<211> 1277 

<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . . (1277) 

<400> 47 
get age gcg gec gac cgc ctg 
Ala Ser Ala Ala Asp Arg Leu 
1 5 

gtg tgg aag gac gec acc acc 
Val Trp Lys Asp Ala Thr Thr 
20 

gec tac gac acc gag gtg cac 
Ala Tyr Asp Thr Glu Val His 
35 

ccc acc gac ccc aac ccc cag 
Pro Thr Asp Pro Asn Pro Gin 
50 55 

aac ttc aac atg ggc aag aac 
Asn Phe Asn Met Gly Lys Asn 
65 70 



tgg gtg acc gtg tac tac ggc gtg ccc 48 
Trp Val Thr Val Tyr Tyr Gly Val Pro 
10 15 

acc ctg ttc tgc gec age gac gec aag 96 
Thr Leu Phe Cys Ala Ser Asp Ala Lys 
25 30 

aac gtg tgg gec acc cac gcg tgc gtg 14 4 

Asn Val Trp Ala Thr His Ala Cys Val 
40 45 

gag gtg gtg ctg ggc aac gtg acc gag 192 
Glu Val Val Leu Gly Asn Val Thr Glu 
60 

aac atg gtg gag cag atg cac gag gat 240 
Asn Met Val Glu Gin Met His Glu Asp 
75 80 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
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85 90 95 

ccc ctg tgc gtg acc ctg caa tgc acc aag ctg aag cag age acc gac 336 

Pro Leu Cys Val Thr Leu Gin Cys Thr Lys Leu Lys Gin Ser Thr Asp 

100 105 110 

acc cag aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Gin Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 

145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 

165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 

225 230 235 240 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 

lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 

245 250 " 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 

Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 " 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 

Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 

305 310 " 315 320 



ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 



1008 
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Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 " 350 

aac acc ace cag ctg ttc aac age ace tgg aac gag ace aac age gag 1104 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 

"^ty Asn lie Thr Ser Gly Thr lie Thr— Leu Pro Cys Arg lie Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gec ccc ccc 1200 

lie He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 

lie Gly Gly Gin lie Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg ag 1277 

Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 * 425 



<210> 48 
<211> 425 
<212> PRT 

<213> Artificial Sequence 
<400> 48 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Gin 


Cys 


Thr 


Lys 


Leu 


Lys 


Gin 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Gin 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 
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195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










215 










220 










Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


He 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 










245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


He 


He 


Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


lie 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly Asp 


Pro 










325 










330 










335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 










He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 

















420 425 



<210> 49 
<211> 144 
<212> PRT 

<213> Artificial Sequence 



<400> 49 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 

ccc ctg tgc gtg acc ctg caa tgc acc aag ctg aag cag age acc gac 336 

acc cag aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

cag ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 432 

ttc tac age ctg gac ate gtg ccc ate gac aac gac cag acc age tac 480 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gec tgc ccc aag 528 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gec ccc gec ggc ttc 576 

gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 720 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 

gac ate ate ggc gac ate cgc cag gec cac tgc aac ate tct aga acc 912 

aac tgg acc aac acc ctg aag cgc gtg gec gag aag ctg cgc gag aag 960 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 
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ggc aac ate 
ate ate aac 
ate ggc ggc 
acc cgc gac 



act agt ggc 
atg tgg cag 
cag ate aag 
ggc ggc age 



acc ate acc 
gag gtg ggc 
tgc ctg age 
gac aac teg 



ctg ccc tgc 
aag gee atg 
aac ate acc 
ag 



cgc ate aag 
tac gee ccc 
ggc ctg ctg 



cag 1152 
ccc 1200 
ctg 1248 
1277 



<210> 50 
<211> 425 
<212> PRT 

<213> Artificial Sequence 



<400> 50 
Ala Ser Ala Ala 
-IT 

Val Trp Lys Asp 
20 

Ala Tyr Asp Thr 
35 

Pro Thr Asp Pro 
50 

Asn Phe Asn Met 
65 

lie lie Ser Leu 

Pro Leu Cys Val 
100 

Thr Gin Asn Thr 
115 

Gin lie Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser lie 
275 

Asp lie lie Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu lie Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn lie Thr 
370 



Asp Arg Leu Trp 
5 

Ala Thr Thr Thr 

Glu Val His Asn 

40 

Asn Pro Gin Glu 
55 

Gly Lys Asn Asn 
70 

Trp Asp Gin Ser 
85 

Thr Leu Gin Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro lie Pro lie 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu lie Asn 

His lie Gly Pro 
280 

Asp lie Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr lie Val Phe 

325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr lie 
375 



Val 


Thr 


Val 


Tyr 




10" 






Leu 


Phe 


Cys 


Ala 


25 








Val 


Trp 


Ala 


Thr 


v ai 


val 


Leu 


*o J. y 








60 


Met 


Val 


Glu 


Gin 






75 




Leu 


Lys 


Pro 


Cys 




90 






Thr 


Lys 


Leu 


Lys 


105 








Gin 


Glu 


Met 


Lys 


Lys 


Met 


Lys 


Arg 








140 


He 


Asp 


Asn 


Asp 






155 




He 


He 


Thr 


Gin 




170 






His 


Phe 


Cys 


Ala 


185 








Thr 


Phe 


Asn 


Gly 


Thr 


His 


Glv 


He 








220 


Ser 


Leu 


Ala 


Glu 






235 




Asn 


Ala 


Lys 


Thr 




250 






Cys 


Thr 


Arg 


Pro 


265 








Gly 


Arg 


Ala 


Phe 


Ala 


His 


Cys 


Asn 








300 


Val 


Ala 


Glu 


Lys 






315 




Asn 


Gin 


Ser 


Ser 




330 






Cys 


Gly 


Gly 


Glu 


345 








Thr 


Trp 


Asn 


Glu 


Thr 


Leu 


Pro 


Cys 



380 



Tyr Gly Val Pro 
15 

Ser Asp Ala Lys 
30 

His Ala Cys Val 
45 

Asn Val Thr Glu 

Met His Glu Asp 
80 

Val Lys Leu Thr 
95 

Gin Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Gin Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg He Lys Gin 
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lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



<210> 51 
<211> 144 
<212> PRT 

<213> Artificial Sequence 





<400> 


51 




























get 


age 


gcg 


gec 


gac 


cgc 


ctg 


tgg 


gtg 


acc 


gtg 


tac 


tac 


ggc 


gtg 


ccc 


48 


gtg 


tgg 


aag 


gac 


gec 


acc 


acc 


acc 


ctg 


ttc 


tgc 


gee 


age 


gac 


gec 


aag 


96 


gec 


tac 


gac 


acc 


gag 


gtg 


cac 


aac 


gtg 


tgg 


gec 


acc 


cac 


gcg 


tgc 


gtg 


144 


ccc 


acc 


gac 


ccc 


aac 


ccc 


cag 


gag 


gtg 


gtg 


ctg 


ggc 


aac 


gtg 


acc 


gag 


192 


aac 


ttc 


aac 


atg 


ggc 


aag 


aac 


aac 


atg 


gtg 


g a g 


cag 


atg 


cac 


gag 


gat 


240 


ate 


ate 


age 


ctg 


tgg 


gac 


cag 


age 


ctg 


aag 


ccc 


tgc 


gtg 


aag 


ctg 


acc 


288 


ccc 


ctg 


tgc 


gtg 


acc 


ctg 


aac 


tgc 


acc 


aag 


ctg 


aag 


aac 


age 


acc 


gac 


336 


acc 


aac 


aac 


acc 


cgc 


tgg 


ggc 


acc 


cag 


gag 


atg 


aag 


aac 


tgc 


age 


ttc 


O O /I 


cag 


ate 


age 


acc 


age 


gtg 


cgc 


aac 


aag 


atg 


aag 


cgc 


gag 


tac 


gec 


ctg 


432 


ttc 


tac 


age 


ctg 


gac 


ate 


gtg 


ccc 


ate 


gac 


aac 


gac 


cag 


acc 


age 


tac 


480 


cgc 


ctg 


cgc 


age 


tgc 


aac 


aca 


teg 


ate 


ate 


acc 


cag 


gec 


tgc 


ccc 


aag 


528 


gtg 


age 


ttc 


gag 


ccc 


ate 


ccc 


ate 


cac 


ttc 


tgc 


gee 


ccc 


gec 


ggc 


ttc 


576 


gec 


ate 


ctg 


aag 


tgc 


aac 


aac 


aag 


acc 


ttc 


aac 


ggc 


acc 


ggc 


ccc 


tgc 


t>Z4 


acc 


aac 


gtg 


age 


acc 


gtg 


cag 


tgc 


acc 


cac 


gga 


att 


cgc 


ccc 


gtg 


gtg 


672 


age 


acc 


cag 


ctg 


ctg 


ctg 


aac 


ggc 


age 


ctg 


gee 


gag 


gag 


gag 


gtg 


gtg 


720 


ate 


aga 


tct 


gag 


aac 


ttc 


acc 


aac 


aac 


gec 


aag 


acc 


ate 


ate 


gtg 


cag 


768 


ctg 


aac 


gag 


age 


gtg 


gag 


ate 


aac 


tgc 


acc 


cgc 


ccc 


aac 


aac 


aac 


acc 


816 


cgc 


aag 


age 


ate 


cac 


ate 


ggc 


cct 


ggc 


cgc 


gee 


ttc 


tac 


acc 


acc 


ggc 


864 


gac 


ate 


ate 


ggc 


gac 


ate 


cgc 


cag 


gec 


cac 


tgc 


aac 


ate 


tct 


aga 


acc 


912 


aac 


tgg 


acc 


aac 


acc 


ctg 


aag 


cgc 


gtg 


gec 


gag 


aag 


ctg 


cgc 


gag 


aag 


960 


ttc 


aac 


aac 


acc 


acc 


ate 


gtg 


ttc 


aac 


cag 


age 


tec 


ggc 


ggc 


gac 


ccc 


1008 


gag 


ate 


gtg 


atg 


cac 


age 


ttc 


aac 


tgc 


ggc 


ggc 


gag 


ttc 


ttc 


tac 


tgc 


1056 


aac 


acc 


acc 


cag 


ctg 


ttc 


aac 


age 


acc 


tgg 


aac 


gag 


acc 


aac 


age 


gag 


1104 


ggc 


aac 


ate 


act 


agt 


ggc 


acc 


ate 


acc 


ctg 


ccc 


tgc 


cgc 


ate 


aag 


cag 


1152 


ate 


ate 


aac 


atg 


tgg 


cag 


gag 


gtg 


ggc 


aag 


gec 


atg 


tac 


gec 


ccc 


ccc 


1200 


ate 


ggc 


ggc 


cag 


ate 


aag 


tgc 


ctg 


age 


aac 


ate 


acc 


ggc 


ctg 


ctg 


ctg 


1248 


acc 


cgc 


gac 


ggc 


ggc 


age 


gac 


aac 


teg 


ag 














1277 




<210> 


52 






























<211> 


425 






























<212> 


PRT 






























<213> 


Artificial Sequence 






















<400> 


52 




























Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 




1 








5 










10 










15 






Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 










20 










25 










30 








Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 








35 










40 










45 










Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 






50 










55 










60 












Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 




65 










70 










75 










80 




He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 





85 90 95 
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Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Gin 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Gin 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






fdra 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr- 


-Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










215 










220 










Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


lie 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 










245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 






Asp 


He 


He 


Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




2 90 










295 










300 








Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly Gly Asp 


Pro 










325 










330 










335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 




















365 








Gly Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 










lie 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 

















420 425 

<210> 53 
<211> 432 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (432) 

<400> 53 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

etc egg caa get gat ctg cac cac ggc cgt gec ctg gaa cgc cag ctg 96 
Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 
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20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 144 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 240 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
~T5 70 75 80 

get gga caa gtg ggc aag ctt gtg gaa ctg gtt caa cat cac caa ctg 288 

Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 
85 90 95 

get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 336 

Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 
100 105 110 



cct gcg cat cgt gtt cac cgt get gag cat cgt gaa ccg cgt gcg cca 384 

Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 

115 120 125 

ggg eta cag ccc cct gag ctt cca gac ccg cct gec cgt gee ccg egg 432 

Gly Leu Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 

130 135 140 



<210> 54 
<211> 144 
<212> PRT 

<213> Artificial Sequence 



<400> 54 



Ser 


Arg 


Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










25 










30 






Glu 


Gin 


Gin 
35 


Glu 


Pro 


Glu 


Pro 


Asp 
40 


Leu 


Gly 


Gin 


His 


Asp 
45 


Leu 


Asp 


Gly 


Val 


Gly 
50 


Ala 


Arg 


Asp 


Gin 


Gin 
55 


Leu 


His 


Arg 


Asp 


His 
60 


Leu 


Gin 


Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 










70 










75 










80 


Ala 


Gly 


Gin 


Val 


Gly 
85 


Lys 


Leu 


Val 


Glu 


Leu 
90 


Val 


Gin 


His 


His 


Gin 
95 


Leu 


Ala 


Val 


Val 


His 


Gin 


Asp 


Phe 


His 


His 


Asp 


Arg 


Gly Arg 


Pro 


Asp 


Arg 








100 










105 










110 






Pro 


Ala 


His 

115 


Arg 


Val 


His 


Arg 


Ala 
120 


Glu 


His 


Arg 


Glu 


Pro 
125 


Arg 


Ala 


Pro 


Gly 


Met 
130 


Gin 


Pro 


Pro 


Glu 


Leu 
135 


Pro 


Asp 


Pro 


Pro 


Ala 
140 


Arg 


Val 


Thr 


Asp 



<210> 55 
<211> 434 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (434) 

<400> 55 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

etc egg caa get gat ctg cac cac ggc cgt gec ctg gaa cgc cag ctg 96 
■feeu Arg Gin Ala Asp Leu His His Gty— Arg Ala Leu Glu Arg Gin Leu 
20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 
Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 240 
Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

get gga caa gtg ggc aag ctt gtg gaa ctg gtt caa cat cac caa ctg 288 
Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 
85 90 95 



get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 
Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 
100 105 110 



336 



cct gcg cat cgt gtt cac cgt get gag cat cgt gaa ccg cgt gcg cca 384 
Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 
115 120 125 

ggg atg cag ccc cct gag ctt cca gac ccg cct gec cgt gtg acg gat 432 
Gly Met Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
130 135 " . 140 

cc 434 

<210> 56 
<211> 144 
<212> PRT 

<213> Artificial Sequence 
<400> 56 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 

20 25 30 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 

35 4 0 4 5 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 
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Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 










70 










75 










80 






^ j. n 


va jl 


tjxy 


Lys 


Leu 


vai 


CjJ.U 


Leu 


Val 


Gin 


Jril S 


His 


Gin 


Leu 










85 










90 










95 




Ala 


Val 


Val 


His 


Gin 


Asp 


Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


Asp 


Arg 








100 










105 










110 






Pro 


Ala 


His 


Arg 


Val 


His 


Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro 


Arg 


Ala 


Pro 






115 










120 










IOC. 
_L iL D 








Gly 


Met 


Gin 


Pro 


Pro 


Glu 


Leu 


Pro 


Asp 


Pro 


Pro 


Ala 


Arg 


Val 


Thr 


Asp 




130 










135 










140 












<210> 


57 




























<211> 


281 




























<212> 


DNA 




























<213> 


Artificial Sequence 




















<220> 






























<221> 


CDS 




























<222> 




. . . (281) 
























<400> 


57 


























tct 


aga 


gcg 


eta 


c ct 




gg a 


cca 


gcg 


ctt 


cct 


ggg 


cat 


gtg 


ggg 


ctg 


Ser 


Arg 


Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


1 








5 










10 










15 




etc 


egg 


caa 


get 


gat 


ctg 


cac 


cac 


ggc 


cgt 


gee 


ctg 


gaa 


cgc 


cag 


ctg 


Leu 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










25 










30 






gag 


caa 


caa 


gaa 


cct 


gag 


cca 


gat 


ttg 


gga 


caa 


cat 


gac 


ctg 


gat 


gga 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp 


Gly 






35 










40 










45 








gtg 


gga 


gcg 


cga 


gat 


cag 


caa 


eta 


cac 


cga 


gat 


cat 


eta 


cag 


cct 


gat 


Val 


Gly 


Ala 


Arg 


Asp 


Gin 


Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 




50 










55 










60 










cga 


gga 


gag 


cca 


gaa 


cca 


gca 


gga 


gaa 


gaa 


cga 


get 


gga 


cct 


get 


cca 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 










70 










75 










80 


get 


gga 


caa 


gtg 


ggc 


aag 


ctt 


gtg 


tga 


ctg 


att 


gag 


gat 


cc 






Ala 


Gly 


Gin 


Val 


Gly 


Lys 


Leu 


Val 


*• 


Leu 


j. j_e 


Pin 
O J_U 


Asp 
















85 












90 














<210> 


58 




























<211> 


92 




























<212> 


PRT 




























<213> 


Artificial Sequence 




















<400> 


58 


























Ser 


Arg 


Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










25 










30 






Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp 


Gly 






35 










40 










45 








Val 


Gly Ala 


Arg 


Asp 


Gin 


Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 



WO 00/29561 



PCT/DK00/00144 



30 



50 55 60 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

Ala Gly Gin Val Gly Lys Leu Val Leu lie Glu Asp 
85 90 



<210> 59 
<211> 272 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (272) 



<400> 59 

ate gat tgc gcg acc tgc tgc tga teg tgg ccc gca teg tgg age tgc 48 

lie Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 
15 10 15 



tgg gee ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tec 96 

Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 

20 25 30 

agt act gga gec agg age tga aga act ctg cag tga gee tgc tga acg 144 

Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin * Ala Cys * Thr 

35 40 



cca ccg cca teg ccg tgg ccg agg gca ccg acc gcg tga teg agg tgg 192 

Pro Pro Pro Ser Pro Trp Pro Arg Ala Pro Thr Ala * Ser Arg Trp 

4 5 50 55 

tgc age gca tct ggc gcg gca tec tgc aca tec cca ccc gaa ttc gee 240 

Cys Ser Ala Ser Gly Ala Ala Ser Cys Thr Ser Pro Pro Glu Phe Ala 

60 65 70 

agg get teg age gcg ccc tgc tgt aag gat cc 272 

Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 

7 5 80 



<210> 60 
<211> 84 
<212> PRT 

<213> Artificial Sequence 



<400> 60 
lie Asp Cys Ala Thr Cys Cys Ser 

1 5 
Ala Gly Ala Ala Gly Arg Ser Ser 

20 

Gly Ala Arg Ser Arg Thr Leu Gin 

35 40 
Trp Pro Arg Ala Pro Thr Ala Ser 

50 55 
Ala Ser Cys Thr Ser Pro Pro Glu 
65 70 
Cys Cys Lys Asp 



Trp Pro Ala Ser Trp Ser Cys Trp 

10 15 
Thr Gly Gly Thr Cys Ser Ser Thr 
25 30 
Ala Cys Thr Pro Pro Pro Ser Pro 
45 

Arg Trp Cys Ser Ala Ser Gly Ala 
60 

Phe Ala Arg Ala Ser Ser Ala Pro 
75 80 
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<210> 61 

<211> 798 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (798) 

<400> 61 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

EeTi Glu Gin Arg Gin Giy Asp Phe PrcTTro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Giy Giy Glu Asp Arg 
20 25 30 

gee cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Giy His Arg Pro His Gin Giy Gin Ala Pro Arg Giy Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 

Arg Glu Ala Arg Arg Giy His Arg Arg Tyr Val Pro Arg Leu Pro Giy 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 240 

Arg Cys Arg Gin His His Giy Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 28 8 

Giy Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 33 6 

Ala Arg His Arg Giy Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Giy 
100 105 110 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 

His Gin Ala Ala Pro Giy Pro Arg Ala Giy Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 432 

Giy Pro Ala Leu Pro Giy His Val Giy Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Giy Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 528 

Pro Asp Leu Giy Gin His Asp Leu Asp Giy Val Giy Ala Arg Asp Gin 
165 " 17 0 17 5 



caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 
Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Giy Glu Pro Glu Pro 
180 185 190 



576 
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gca gga gaa 
Ala Gly Glu 
195 

ctt gtg gaa 
Leu Val Glu 
210 

ttt cat cat 
Phe His His 
225 

cgt get gag 
Arg Ala Glu 



ctt cca gac 
Leu Pro Asp 



gaa cga get 
Glu Arg Ala 



ctg gtt caa 
Leu Val Gin 



gat cgt ggg 
Asp Arg Gly 
230 

cat cgt gaa 
His Arg Glu 
245 

ccg cct gec 
Pro Pro Ala 
260 



gga cct get 
Gly Pro Ala 
200 

cat cac caa 
His His Gin 
215 

egg cct gat 
Arg Pro Asp 



ccg cgt gcg 
Pro Arg Ala 



cgt gec ccg 
Arg Ala Pro 
265 



cca get gga 
Pro Ala Gly 



ctg get gtg 
Leu Ala Val 
220 

egg cct gcg 
Arg Pro Ala 
235 

'cca ggg eta 
Pro Gly Leu 
250 

egg 
Arg 



caa gtg ggc 
Gin Val Gly 
205 

gta cat caa 
Val His Gin 



cat cgt gtt 
His Arg Val 



cag ccc cct 
Gin Pro Pro 
255 



aag 624 
Lys 



gat 672 
Asp 



cac 720 

His 

240 

gag 768 
Glu 



798 



<210> 62 
<211> 266 
<212> PRT 

<213> Artificial Sequence 



<400> 62 



Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


1 








5 








Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 








20 










Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 






35 










40 


Arg 


Glu 


Ala 


Arg 


Arg 


Gly 


His 


Arg 




50 










55 




Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg 


65 










70 






Gly 


Pro 


Pro 


Ala 


Ala 


Glu 


Arg 


His 










85 








Ala 


Arg 


His 


Arg 


Gly 


Pro 


Ala 


Ala 








100 










His 


Gin 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 






115 










120 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 




130 










135 




His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


145 










150 






Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 










165 








Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 








180 










Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 






195 










200 


Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 




210 










215 




Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


225 










230 






Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro 


Arg 



245 



Pro 


Pro 
10 


Arg 


Arg 


Arg 


Arg 


His 
15 


Ala 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 


25 










30 






Gly 


Gin 


Ala 


Pro 


Arg 
45 


Gly 


Ala 


Ala 


Arg 


Tyr 


Val 


Pro 
60 


Arg 


Leu 


Pro 


Gly 


Arg 


Gin 


Pro 
75 


Asp 


Pro 


Asp 


Arg 


Ala 
80 


Arg 


Ala 
90 


Ala 


Ala 


Glu 


Gin 


Pro 
95 


Ala 


Pro 


Ala 


Pro 


Ala 


Asp 


Arg 


Val 


Gly 


105 










110 






Ala 


Gly 


Ser 


Arg 


Ala 
125 


Leu 


Pro 


Pro 


Gly 


Leu 


Leu 


Arg 
140 


Gin 


Ala 


Asp 


Leu 


Gin 


Leu 


Glu 
155 


Gin 


Gin 


Glu 


Pro 


Glu 
160 


Asp 


Gly 
170 


Val 


Gly 


Ala 


Arg 


Asp 
175 


Gin 


Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


185 










190 






Ala 


Pro 


Ala 


Gly 


Gin 
205 


Val 


Gly 


Lys 


Gin 


Leu 


Ala 


Val 
220 


Val 


His 


Gin 


Asp 


Asp 


Arg 


Pro 
235 


Ala 


His 


Arg 


Val 


His 
240 


Ala 


Pro 
250 


Gly 


Leu 


Gin 


Pro 


Pro 
255 


Glu 
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Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
260 265 

<210> 63 
<211> 800 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (800) 

<400> 63 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 " 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 4 32 

Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 ' 175 



caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 
Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 



576 



WO 00/29561 



PCT/DK00/00144 



34 



180 

gca gga gaa gaa cga get 
Ala Gly Glu Glu Arg Ala 
195 

ctt gtg gaa ctg gtt caa 
Leu Val Glu Leu Val Gin 
210 

ttt cat cat gat cgt ggg 
Phe His His Asp Arg Gly 
T2r5 230 



185 

gga cct get cca get gga 

Gly Pro Ala Pro Ala Gly 
200 

cat cac caa ctg get gtg 

His His Gin Leu Ala Val 

215 220 

egg cct gat egg cct gcg 

Arg Pro Asp Arg Pro Ala 
235 



190 

caa gtg ggc aag 624 

Gin Val Gly Lys 

205 

gta cat caa gat 672 
Val His Gin Asp 



cat cgt gtt cac 720 
His Arg Val His 
240 



cgt get gag cat cgt gaa ccg cgt gcg cca ggg atg cag ccc cct gag 7 68 

Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 

245 . 250 255 

ctt cca gac ccg cct gee cgt gtg acg gat cc 800 

Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
260 265 



<210> 64 
<211> 266 
<212> PRT 
<213> Arti 

<400> 64 
Leu Glu Gin Arg 
1 

Arg Gin Leu Ala 
20 

Ala Pro Gly His 
35 

Arg Glu Ala Arg 
50 

Arg Cys Arg Gin 
65 

Gly Pro Pro Ala 

Ala Arg His Arg 
100 

His Gin Ala Ala 
115 

Gly Pro Ala Leu 
130 

His His Gly Arg 
14 5 

Pro Asp Leu Gly 

Gin Leu His Arg 
180 

Ala Gly Glu Glu 
195 

Leu Val Glu Leu 
210 

Phe His His Asp 
225 



ficial Sequence 



Gin 


Gly 


Asp 


Phe 


5 








Gin 


Arg 


Ala 


Val 


Arg 


Pro 


His 


Gin 








40 


Arg 


Gly 


His 


Arg 






55 




His 


His 


Gly 


Arg 




70 






Ala 


Glu 


Arg 


His 


85 








Gly 


Pro 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 








120 


Pro 


Gly 


His 


Val 






135 




Ala 


Leu 


Glu 


Arg 




150 






Gin 


His 


Asp 


Leu 


165 








Asp 


His 


Leu 


Gin 


Arg 


Ala 


Gly 


Pro 








200 


Val 


Gin 


His 


His 






215 




Arg 


Gly 


Arg 


Pro 




230 







Pro 


Pro 


Arg 


Arg 




10 






Gin 


Val 


Gin 


Gly 


25 








Gly 


Gin 


Ala 


Pro 


Arg 


Tyr 


Val 


Pro 








60 


Arg 


Gin 


Pro 


Asp 






75 




Arg 


Ala 


Ala 


Ala 




90 






Pro 


Ala 


Pro 


Ala 


105 








Ala 


Gly 


Ser 


Arg 


Gly 


Leu 


Leu 


Arg 








140 


Gin 


Leu 


Glu 


Gin 






155 




Asp 


Gly 


Val 


Gly 




170 






Pro 


Asp 


Arg 


Gly 


185 








Ala 


Pro 


Ala 


Gly 


Gin 


Leu 


Ala 


Val 








220 


Asp 


Arg 


Pro 


Ala 






235 





Arg Arg His Ala 
15 

Gly Glu Asp Arg 
30 

Arg Gly Ala Ala 
45 

Arg Leu Pro Gly 

Pro Asp Arg Ala 
80 

Glu Gin Pro Ala 
95 

Asp Arg Val Gly 
110 

Ala Leu Pro Pro 
125 

Gin Ala Asp Leu 

Gin Glu Pro Glu 
160 

Ala Arg Asp Gin 
175 

Glu Pro Glu Pro 
190 

Gin Val Gly Lys 
205 

Val His Gin Asp 

His Arg Val His 
240 
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Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 

245 250 255 

Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 

260 265 

<210> 65 
<211> 647 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) ... (647) 

<400> 65 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 
Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 



1 



5 10 15 



gga 



cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 



cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 * 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



48 



96 



cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
2 0 2 5 30 

gee cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 144 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 
Gly Pro Pro Ala Ala Glu Arg His Arq Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 • HO 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 



240 



288 



336 



432 



480 



528 
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caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 576 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
180 185 190 

gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 

Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 



ctt gtg tga ctg att gag gat cc 
Leu Val * Leu lie Glu Asp 
210 



647 



<210> 66 
<211> 214 
<212> PRT 

<213> Artificial Sequence 



<400> 66 
Leu Glu Gin Arg 
1 

Arg Gin Leu Ala 
20 

Ala Pro Gly His 
35 

Arg Glu Ala Arg 
50 

Arg Cys Arg Gin 
65 

Gly Pro Pro Ala 

Ala Arg His Arg 
100 

His Gin Ala Ala 
115 

Gly Pro Ala Leu 
130 

His His Gly Arg 
145 

Pro Asp Leu Gly 

Gin Leu His Arg 
180 

Ala Gly Glu Glu 
195 

Leu Val Leu lie 
210 



Gin Gly Asp Phe 
5 

Gin Arg Ala Val 

Arg Pro His Gin 
40 

Arg Gly His Arg 
55 

His His Gly Arg 
70 

Ala Glu Arg His 
85 

Gly Pro Ala Ala 

Pro Gly Pro Arg 
120 

Pro Gly His Val 
135 

Ala Leu Glu Arg 
150 

Gin His Asp Leu 
165 

Asp His Leu Gin 

Arg Ala Gly Pro 
200 

Glu Asp 



Pro Pro Arg Arg 
10 

Gin Val Gin Gly 
25 

Gly Gin Ala Pro 

Arg Tyr Val Pro 
60 

Arg Gin Pro Asp 
75 

Arg Ala Ala Ala 
90 

Pro Ala Pro Ala 
105 

Ala Gly Ser Arg 

Gly Leu Leu Arg 
140 

Gin Leu Glu Gin 
155 

Asp Gly Val Gly 
170 

Pro Asp Arg Gly 
185 

Ala Pro Ala Gly 



Arg Arg His Ala 
15 

Gly Glu Asp Arg 
30 

Arg Gly Ala Ala 
45 

Arg Leu Pro Gly 

Pro Asp Arg Ala 
80 

Glu Gin Pro Ala 
95 

Asp Arg Val Gly 
110 

Ala Leu Pro Pro 
125 

Gin Ala Asp Leu 

Gin Glu Pro Glu 
160 

Ala Arg Asp Gin 
175 

Glu Pro Glu Pro 
190 

Gin Val Gly Lys 
205 



<210> 67 
<211> 1918 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (1918) 



<400> 67 

get age gcg gec gac cgc ctg tgg gtg ace gtg tac tac ggc gtg ccc 



48 
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Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gec aag 96 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 144 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

-Pro Thr Asp Pro Asn Pro Gin Glu Varir- Val Leu Gly Asn Val Thr Glu 

50 " 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 

65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 4 32 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 

145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gcc tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 

165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gcc ccc gcc ggc ttc 576 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 

210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gcc gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 

225 230 235 240 
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ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 7 68 

lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 
Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 
Arg Lys Ser lie His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

tfa-c ate ate ggc gac ate cgc cag gcrrxa'c tgc aac ate tct aga acc 912 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gec gag aag ctg cgc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 
Gly Asn lie Thr Ser Gly Thr lie Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc 1200 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
4 05 " 410 415 

acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 1296 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 

cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 134 4 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 1392 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 4 60 



gee aag cgc cgc gtg gtg cag cgc gag aag cgc gec gtg ggc ate ggc 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
4 65 470 475 480 



1440 
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get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gee 1488 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 

485 490 495 

gee age ctg acc ctg acc gtg cag gec cgc cag ctg ctg age ggc ate 1536 

Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 

500 505 510 



gtg cag cag cag aac aac ctg ctg cgc gee ate gag gee cag cag cac 1584 

Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His 

515 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gee cgc gtg 1632 

Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 

530 535 540 



ctg get eta 
Leu Ala Leu 
545 

ggc tgc tec 
Gly Cys Ser 



age tgg age 
Ser Trp Ser 



atg gag tgg 
Met Glu Trp 
595 

ctg ate gag 
Leu lie Glu 
610 

etc cag ctg 
Leu Gin Leu 
625 



gag cgc tac 
Glu Arg Tyr 
550 

ggc aag ctg 
Gly Lys Leu 
565 

aac aag aac 
Asn Lys Asn 
580 

gag cgc gag 
Glu Arg Glu 



gag age cag 
Glu Ser Gin 



gac aag tgg 
Asp Lys Trp 
630 



etc cag gac 
Leu Gin Asp 



ate tgc acc 
He Cys Thr 



ctg age cag 
Leu Ser Gin 
585 

ate age aac 
He Ser Asn 
600 

aac cag cag 
Asn Gin Gin 
615 

gca age ttg 
Ala Ser Leu 



cag cgc ttc 
Gin Arg Phe 
555 

acg gec gtg 
Thr Ala Val 
570 

att tgg gac 
He Trp Asp 



tac acc gag 
Tyr Thr Glu 



gag aag aac 
Glu Lys Asn 
620 

tgt gac tga 
Cys Asp * 
635 



ctg ggc atg 
Leu Gly Met 



ccc tgg aac 
Pro Trp Asn 
575 

aac atg acc 
Asn Met Thr 
590 

ate ate tac 
He He Tyr 
605 

gag ctg gac 
Glu Leu Asp 



ttg agg ate 
Leu Arg He 



tgg 1680 

Trp 

560 

gec 1728 
Ala 



tgg 1776 
Trp 



age 1824 
Ser 



ctg 1872 
Leu 



c 1918 



<210> 68 
<211> 638 
<212> PRT 

<213> Artificial Sequence 



<400> 68 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
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Pro Leu Cys Val 
100 

Thr Asn Asn Thr 
115 

Asn lie Ser Thr 
130 

Phe Tyr Ser Leu 
14 5 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser lie 
275 

Asp lie lie Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu lie Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn lie Thr 
370 

lie lie Asn Met 
385 

lie Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 
435 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 
465 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 
530 

Leu Ala Leu Glu 
545 



85 

Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro lie Pro lie 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu lie Asn 

His lie Gly Pro 
280 

Asp lie Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr lie Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr lie 
375 

Trp Gin Glu Val 
390 

lie Lys Cys Leu 
4 05 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys lie Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 
485 

Leu Thr Val Gin 

Asn Asn Leu Leu 
520 

Thr Val Trp Gly 

535 

Arg Tyr Leu Gin 
550 



90 

Thr Lys Leu Lys 
105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

lie Asp Asn Asp 
15 5 

He He Thr Gin 
170 

His Phe Cys Ala 
18"5~~ 

Thr Phe Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 
265 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn He Thr 
410 

Ser Ser Ser Gly 
425 

Asp Asn Trp Arg 

Pro Leu Gly He 
4 60 

Glu Lys Arg Ala 
475 

Ala Ala Gly Ser 
490 

Ala Arg Gin Leu 
505 

Arg Ala He Glu 

He Lys Gin Leu 
540 

Asp Gin Arg Phe 
555 



95 

Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 

365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
4 00 

Gly Leu Leu Leu 
415 

Lys Glu He Phe 
4 30 

Ser Glu Leu Tyr 
445 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 
495 

Leu Ser Gly He 
510 

Ala Gin Gin His 
525 

Gin Ala Arg Val 

Leu Gly Met Trp 

560 
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Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 

565 570 575 

Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 

580 585 590 

Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Glu lie lie Tyr Ser 

595 600 605 

Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 

610 615 620 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Cys Asp Leu Arg lie 
625 630 635 

<210> 69 
<211> 2071 
<212> DRA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2071) 

<400> 69 

get age gcg gcc gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 4 8 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gcc aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 14 4 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
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145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate ace cag gee tgc ccc aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 ^ 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gec ggc ttc 576 
Val Ser Phe Glu Pro lie Pro He His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gec ate ctg aag tgc aac aac aag ace ttc aac ggc ace ggc ccc tgc 624 
Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc ace cac gga att cgc ccc gtg gtg 672 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 720 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 ~ 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 912 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 " 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 
Gly Asn lie Thr Ser Gly Thr lie Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 



ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc 



1200 
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lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate ace ggc ctg ctg ctg 1248 
lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 
405 410 415 

ace cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 1296 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu lie Phe 
420 425 430 

cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 1344 
Arg Pro Gly Gly Gly Asp Met Arg Asp - "Ash Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 1392 
Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly lie Ala Pro Thr Lys 
4 50 4 55 4 60 

gee aag cgc cgc gtg gtg cag cgc gag aag cgc gee gtg ggc ate ggc 1440 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 

get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gee 1488 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 

gee age ctg acc ctg acc gtg cag gec cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gee ate gag gec cag cag cac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His 
515 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gec cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 
530 535 540 

ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 1680 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 • 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gec gtg ccc tgg aac gee 1728 
Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 ~ 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 
Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 1824 
Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Glu lie lie Tyr Ser 
595 600 605 

ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 ~ 620 
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etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate ace 1920 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 

625 630 635 640 

aac tgg ctg tgg tac ate aag att ttc ate atg ate gtg ggc ggc ctg 1968 

Asn Trp Leu Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu 

645 650 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aac cgc gtg 2016 

He Gly Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val 

660 665 670 

cgc cag gga tgc age ccc ctg age ttc cag acc cgc ctg ccc gtg tga 2064 

Arg Gin Gly Cys Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val * 

675 680 685 

egg ate c 2071 
Arg He 



<210> 70 
<211> 689 
<212> PRT 

<213> Artificial Sequence 
<400> 70 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

1 5 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 ~ 110 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

Phe Tyr Ser Leu Asp lie Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 ~ 175 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 
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Leu 


Asn 


Glu 


Ser 








2 60 


Arg 


Lys 


Ser 


He 






275 




Asp 


He 


He 


Gly 




290 






Asn 


Trp 


Thr 


Asn 


305 








Phe 


Asn 


Asn 


Thr 


Glu 


He 


Val 


Met 








la n 

O ft u 


Ash 


Thr 


Thr 


Gin 






355 




Gly 


Asn 


He 


Thr 




370 






lie 


He 


Asn 


Met 


385 








He 


Gly 


Gly 


Gin 


Thr 


Arg 


Asp 


Gly 








a o r\ 
4 2 0 


Arg 


Pro 


Gly 


Gly 






435 




Lys 


Tyr 


Lys 


Val 




450 






Ala 


Lys 


Arg 


Arg 


465 








Ala 


Met 


Phe 


Leu 


Ala 


Ser 


Leu 


Thr 








jUU 


Val 


Gin 


Gin 


Gin 






515 




Leu 


Leu 


Gin 


Leu 




530 






Leu 


Ala 


Leu 


Glu 


545 








Gly 


Cys 


Ser 


Gly 


Ser 


Trp 


Ser 


Asn 








s ft n 


Met 


Glu 


Trp 


Glu 






595 




Leu 


He 


Glu 


Glu 




610 






Leu 


Gin 


Leu 


Asp 


625 








Asn 


Trp 


Leu 


Trp 


He 


Gly 


Leu 


Arg 








660 


Arg 


Gin 


Gly 


Cys 






67 5 





Val 


Glu 


He 


Asn 


His 


He 


Gly 


Pro 








28 0 


Asp 


He 


Arg 


Gin 






295 




Thr 


Leu 


Lys 


Arg 




310 






Thr 


He 


Val 


Phe 


325 








His 


Ser 


Phe 


Asn 


Leu 


Phe 


Asn 


Ser 








*3 C C\ 


Ser 


Gly 


Thr 


He 






375 




Trp 


Gin 


Glu 


Val 




390 






He 


Lys 


Cys 


Leu 


405 








Gly 


Ser 


Asp 


Asn 


Gly 


Asp 


Met 


Arg 








440 


Val 


Lys 


He 


Glu 






455 




Val 


Val 


Gin 


Arg 




470 






Gly 


Phe 


Leu 


Gly 


485 








Leu 


Thr 


Val 


Gin 



Asn Asn Leu Leu 
520 

Thr Val Trp Gly 
535 

Arg Tyr Leu Gin 
550 

Lys Leu He Cys 
5 65 

Lys Asn Leu Ser 

Arg Glu He Ser 
600 

Ser Gin Asn Gin 
615 

Lys Trp Ala Ser 
630 

Tyr He Lys He 
645 

He Val Phe Thr 

Ser Pro Leu Ser 
680 



Cys 


Thr 


Arg 


Pro 


265 








Gly 


Arg 


Ala 


Phe 


Ala 


His 


Cys 


Asn 








300 


Val 


Ala 


Glu 


Lys 






315 




Asn 


Gin 


Ser 


Ser 




330 






Cys 


Gly 


Gly 


Glu 


345 








Thr Trp 


Asn 


Glu 


Thr 


Leu 


Pro 


Cys 








380 


Gly 


Lys 


Ala 


Met 






395 




Ser 


Asn 


He 


Thr 




4 1 U 






Ser 


Ser 


Ser 


Gly 


425 








Asp 


Asn 


Trp 


Arg 


Pro 


Leu 


Gly 


He 








460 


Glu 


Lys 


Arg 


Ala 






475 




7\ "1 _ 

Ala 


Ala 


Gly 


Ser 




4 90 






Ala 


Arg 


Gin 


Leu 


505 








Arg 


Ala 


He 


Glu 


He 


Lys 


Gin 


Leu 








54 0 


Asp 


Gin 


Arg 


Phe 






555 




Thr 


Thr 


Ala 


Val 




570 






Gin 


He 


Trp 


Asp 


585 








Asn 


Tyr 


Thr 


Glu 


Gin 


Glu 


Lys 


Asn 








620 


Leu 


Trp 


Asn 


Trp 






635 




Phe 


He 


Met 


He 




650 






Val 


Leu 


Ser 


He 


665 








Phe 


Gin 


Thr 


Arg 



Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 

Lys Glu He Phe 
430 

Ser Glu Leu Tyr 
4 45 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 
495 

Leu Ser Gly lie 
510 

Ala Gin Gin His 
525 

Gin Ala Arg Val 

Leu Gly Met Trp 
560 

Pro Trp Asn Ala 
575 

Asn Met Thr Trp 
590 

He He Tyr Ser 
605 

Glu Leu Asp Leu 

Phe Asn He Thr 
640 

Val Gly Gly Leu 
655 

Val Asn Arg Val 
670 

Leu Pro Val Arg 
685 



<210> 71 
<211> 2469 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2469) 

<400> 71 

get age gcg gcc gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

1 5 10 15 

TJtg tgg aag gac gcc acc acc acc ctrg^ttc tgc gcc age gac gcc aag 96 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 144 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 4 0 4 5 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 

65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 " " 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 ' 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 4 32 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 

145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gcc tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 

165 170 " 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gcc ccc gcc ggc ttc 576 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 
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acc aac gtg 
Thr Asn Val 
210 

age acc cag 
Ser Thr Gin 
225 

ate aga tct 
lie Arg Ser 



ctg aac gag 
Leu Asn Glu 



cgc aag age 
Arg Lys Ser 
27 5 

gac ate ate 
Asp lie lie 
290 

aac tgg acc 
Asn Trp Thr 
305 

ttc aac aac 
Phe Asn Asn 



gag ate gtg 
Glu lie Val 



aac acc acc 
Asn Thr Thr 
355 

ggc aac ate 
Gly Asn lie 
370 

ate ate aac 
lie lie Asn 
385 

ate ggc ggc 
lie Gly Gly 



acc cgc gac 
Thr Arg Asp 



cgc ccc ggc 
Arg Pro Gly 



age acc gtg 
Ser Thr Val 



ctg ctg ctg 
Leu Leu Leu 
230 

gag aac ttc 
Glu Asn Phe 
245 

age gtg gag 
Ser Val Glu 
260 

ate cac ate 
lie His He 



ggc gac ate 
Gly Asp lie 



aac acc ctg 
Asn Thr Leu 
310 

acc acc ate 
Thr Thr He 
325 

atg cac age 
Met His Ser 
340 

cag ctg ttc 
Gin Leu Phe 



act agt ggc 
Thr Ser Gly 



atg tgg cag 
Met Trp Gin 
390 

cag ate aag 
Gin He Lys 
405 

ggc ggc age 
Gly Gly Ser 
420 

ggc ggc gac 
Gly Gly Asp 



47 



cag tgc acc 
Gin Cys Thr 
215 

aac ggc age 
Asn Gly Ser 



acc aac aac 
Thr Asn Asn 



ate aac tgc 
He Asn Cys 
265 

ggc cct ggc 
Gly Pro Gly 
280 

cgc cag gec 
Arg Gin Ala 
295 

aag cgc gtg 
Lys Arg Val 



gtg ttc aac 
Val Phe Asn 



ttc aac tgc 
Phe Asn Cys 
345 

aac age acc 
Asn Ser Thr 
360 

acc ate acc 
Thr He Thr 
375 

gag gtg ggc 
Glu Val Gly 



tgc ctg age 
Cys Leu Ser 



gac aac teg 
Asp Asn Ser 
425 

atg cgc gac 
Met Arg Asp 



cac gga att 
His Gly He 
220 

ctg gee gag 
Leu Ala Glu 
235 

gee aag acc 
Ala Lys Thr 
250 

acc cgc ccc 
Thr Arg Pro 



cgc gee ttc 
Arg Ala Phe 



cac tgc aac 
His Cys Asn 
300 

gec gag aag 
Ala Glu Lys 
315 

cag age tec 
Gin Ser Ser 
330 

ggc ggc gag 
Gly Gly Glu 



tgg aac gag 
Trp Asn Glu 



ctg ccc tgc 
Leu Pro Cys 
380 

aag gee atg 
Lys Ala Met 
395 

aac ate acc 
Asn He Thr 
410 

age age ggc 
Ser Ser Gly 



aac tgg cgc 
Asn Trp Arg 



cgc ccc gtg 
Arg Pro Val 



gag gag gtg 
Glu Glu Val 



ate ate gtg 
He He Val 

255 

aac aac aac 
Asn Asn Asn 
270 

tac acc acc 
Tyr Thr Thr 
285 

ate tct aga 
He Ser Arg 



ctg cgc gag 
Leu Arg Glu 



ggc ggc gac 
Gly Gly Asp 
335 

ttc ttc tac 
Phe Phe Tyr 
350 

acc aac age 
Thr Asn Ser 
365 

cgc ate aag 
Arg He Lys 



tac gec ccc 
Tyr Ala Pro 



ggc ctg ctg 
Gly Leu Leu 
415 

aag gag att 
Lys Glu He 
430 

age gag ctg 
Ser Glu Leu 



gtg 672 
Val 



gtg 720 

Val 

240 

cag 768 
Gin 



acc 816 
Thr 



ggc 864 
Gly 



acc 912 
Thr 



aag 960 

Lys 

320 

ccc 1008 
Pro 



tgc 1056 
Cys 



gag 1104 
Glu 



cag 1152 
Gin 



ccc 1200 

Pro 

400 

ctg 1248 
Leu 



ttc 1296 
Phe 



tac 1344 
Tyr 
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435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gee ccc acc aag 1392 
Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly lie Ala Pro Thr Lys 
450 455 460 

gee aag cgc cgc gtg gtg cag cgc gag aag cgc gec gtg ggc ate ggc 14 40 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 

get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gee 1488 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 49-0 495 

gee age ctg acc ctg acc gtg cag gec cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gec ate gag gec cag cag cac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gee cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 
530 535 ~ 540 

ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 1680 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gee gtg ccc tgg aac gec 1728 
Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 ' 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 
Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 1824 
Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 
595 600 605 

ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 

etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate acc 1920 
Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 
625 630 635 640 

aac tgg ctg tgg tac ate aag att ttc ate atg ate gtg ggc ggc ctg 1968 
Asn Trp Leu Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu 
645 650 "* 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aac cgc gtg 2016 
He Gly Leu Arg lie Val Phe Thr Val Leu Ser He Val Asn Arg Val 
660 665 670 



cgc cag ggc tac age ccc ctg age ttc cag acc cgc ctg ccc gtg ccc 



2064 
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Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val Pro 
675 680 685 

cgc ggc ccc gac cgc ccc gag ggc ate gag gag gag ggc ggc gag cgc 2112 
Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly Glu Arg 
690 695 700 

gac cgc gac cgc age acc cgc ctg gtg ace ggc ttc ctg ccc ctg ate 2160 
Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly Phe Leu Pro Leu He 
705 710 715 720 

tgg gac gac ctg cgc age ctg ttc ctg ttc age tac cat cga ttg cgc 2208 
s ¥^ep Asp Asp Leu Arg Ser Leu Phe Letr-Phe Ser Tyr His Arg Leu Arg 
725 730 ' 735 

gac ctg ctg ctg ate gtg gec cgc ate gtg gag ctg ctg ggc egg cgc 2256 
Asp Leu Leu Leu He Val Ala Arg He Val Glu Leu Leu Gly Arg Arg 
740 745 750 

ggc tgg gag ate ctg aag tac tgg tgg aac ctg etc cag tac tgg age 2304 
Gly Trp Glu He Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser 
755 760 765 

cag gag ctg aag aac tct gca gtg age ctg ctg aac gec acc gee ate 2352 
Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala He 
770 775 780 

gec gtg gec gag ggc acc gac cgc gtg ate gag gtg gtg cag cgc ate 2400 
Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Val Gin Arg He 
785 790 795 800 

tgg cgc ggc ate ctg cac ate ccc acc cga att cgc cag ggc ttc gag 2448 
Trp Arg Gly He Leu His He Pro Thr Arg He Arg Gin Gly Phe Glu 
805 810 815 

cgc gec ctg ctg taa gga tec 2469 
Arg Ala Leu Leu * Gly Ser 
820 



<210> 72 
<211> 822 
<212> PRT 

<213> Artificial Sequence 



<400> 72 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 




Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 






Pro 


Thr 
50 


Asp 


Pro 


Asn 


Pro 


Gin 
55 


Glu 


Val 


Val 


Leu 


Gly 
60 


Asn 


Val 


Thr 


Glu 


Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


lie 


Ser 


Leu 


Trp 
85 


Asp 


Gin 


Ser 


Leu 


Lys 
90 


Pro 


Cys 


Val 


Lys 


Leu 
95 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 
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100 

Thr Asn Asn Thr 
115 

Asn lie Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser lie 
275 

Asp lie lie Gly 
2 90 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu lie Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn lie Thr 
370 

lie lie Asn Met 
385 

lie Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 
4 35 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 
4 65 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 
530 

Leu Ala Leu Glu 
545 

Gly Cys Ser Gly 



Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro lie Pro lie 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu lie Asn 

His lie Gly Pro 
280 

Asp lie Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr He 
375 

Trp Gin Glu Val 
390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 

4 85 

Leu Thr Val Gin 

Asn Asn Leu Leu 
520 

Thr Val Trp Gly 
535 

Arg Tyr Leu Gin 
550 

Lys Leu He Cys 

5 65 



105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

He Asp Asn Asp 
155 

He He Thr Gin 
170 

His Phe Cys Ala 
185 

Thr Phe Asn Gly 

Thr His Gly He 

220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 
2 65 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn He Thr 
410 

Ser Ser Ser Gly 
425 

Asp Asn Trp Arg 

Pro Leu Gly He 
460 

Glu Lys Arg Ala 
475 

Ala Ala Gly Ser 
490 

Ala Arg Gin Leu 
505 

Arg Ala He Glu 

He Lys Gin Leu 
540 

Asp Gin Arg Phe 
555 

Thr Thr Ala Val 
570 



110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 
17 5 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 

270 

Tyr Thr Thr Gly 
28 5 

lie Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 

365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 

Lys Glu He Phe 
430 

Ser Glu Leu Tyr 
4 45 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 
495 

Leu Ser Gly He 
510 

Ala Gin Gin His 
525 

Gin Ala Arg Val 

Leu Gly Met Trp 
560 

Pro Trp Asn Ala 
575 
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Ser Trp Ser Asn 
580 

Met Glu Trp Glu 
595 

Leu lie Glu Glu 
610 

Leu Gin Leu Asp 
625 

Asn Trp Leu Trp 

lie Gly Leu Arg 
660 

Jteg Gin Gly Tyr 
675 

Arg Gly Pro Asp 
690 

Asp Arg Asp Arg 
705 

Trp Asp Asp Leu 

Asp Leu Leu Leu 
740 

Gly Trp Glu lie 
755 

Gin Glu Leu Lys 
770 

Ala Val Ala Glu 
785 

Trp Arg Gly lie 

Arg Ala Leu Leu 

820 



Lys Asn Leu Ser 

Arg Glu lie Ser 
600 

Ser Gin Asn Gin 
615 

Lys Trp Ala Ser 
630 

Tyr lie Lys lie 
645 

Tie Val Phe Thr 

Ser Pro Leu Ser 
680 

Arg Pro Glu Gly 
695 

Ser Thr Arg Leu 
710 

Arg Ser Leu Phe 
725 

He Val Ala Arg 

Leu Lys Tyr Trp 
760 

Asn Ser Ala Val 
775 

Gly Thr Asp Arg 
790 

Leu His He Pro 
805 

Gly Ser 



Gin He Trp Asp 
58 5 

Asn Tyr Thr Glu 

Gin Glu Lys Asn 
620 

Leu Trp Asn Trp 
635 

Phe He Met lie 
650 

Val Leu Ser He 
665 

Phe- Gin Thr Arg 

He Glu Glu Glu 

700 

Val Thr Gly Phe 
715 

Leu Phe Ser Tyr 
730 

lie Val Glu Leu 
745 

Trp Asn Leu Leu 

Ser Leu Leu Asn 
780 

Val He Glu Val 
795 

Thr Arg He Arg 
810 



Asn Met Thr Trp 
590 

He He Tyr Ser 
605 

Glu Leu Asp Leu 

Phe Asn lie Thr 
640 

Val Gly Gly Leu 
655 

Val Asn Arg Val 
670 

Leu Pro Val Pro 
685 

Gly Gly Glu Arg 

Leu Pro Leu lie 
720 

His Arg Leu Arg 
735 

Leu Gly Arg Arg 
7 50 

Gin Tyr Trp Ser 
7 65 

Ala Thr Ala He 

Val Gin Arg He 
800 

Gin Gly Phe Glu 
815 



<210> 73 

<211> 1431 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1431) 

<400> 73 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 4 8 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
1 5 10 15 



gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 



gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 
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aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 



240 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg ace 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 



288 



ccc ctg tgc gtg ace ctg aac tgc acc aag ctg aag aac age acc gac 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 



336 



*&c aac aac acc cgc tgg ggc acc ca-g--gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 ^ 125 



384 



aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 



432 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 ^ 160 



480 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 17 0 ^ 175 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gec ggc ttc 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 



576 



gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 



624 



acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 



672 



age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Le.u Ala Glu Glu Glu Val Val 
225 230 235 240 



720 



ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 



768 



ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 



816 



cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 



864 



gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 



912 
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aac tgg acc 
Asn Trp Thr 
305 

ttc aac aac 
Phe Asn Asn 



gag ate gtg 
Glu lie Val 



aac acc acc 
Asn Thr Thr 
355 

ggc aac ate 
Gly Asn He 
370 

ate ate aac 
He lie Asn 
385 

ate ggc ggc 
He Gly Gly 



acc cgc gac 
Thr Arg Asp 



aac acc ctg 
Asn Thr Leu 
310 

acc acc ate 
Thr Thr He 
325 

atg cac age 
Met His Ser 
340 

cag ctg ttc 
Gin Leu Phe 



act agt ggc 
Thr Ser Gly 



atg tgg cag 
Met Trp Gin 
390 

cag ate aag 
Gin He Lys 
405 

ggc ggc age 
Gly Gly Ser 
420 



aag cgc gtg 
Lys Arg Val 



gtg ttc aac 
Val Phe Asn 



ttc aac tgc 
Phe Asn Cys 
345 

aac age acc 
Asn Ser Thr 
360 

acc ate acc 
Thr He Thr 
375 

gag gtg ggc 
Glu Val Gly 



tgc ctg age 
Cys Leu Ser 



gac aac teg 
Asp Asn Ser 
425 



gee gag aag 
Ala Glu Lys 
315 

cag age tec 
Gin Ser Ser 
330 

ggc ggc gag 
Gly Gly Glu 



tgg aac gag 
Trp Asn Glu 



ctg ccc tgc 
Leu Pro Cys 
380 

aag gec atg 
Lys Ala Met 
395 

aac ate acc 
Asn He Thr 
410 

age age ggc 
Ser Ser Gly 



ctg cgc gag 
Leu Arg Glu 



ggc ggc gac 
Gly Gly Asp 
335 

ttc ttc tac 
Phe Phe Tyr 
350 

acc aac age 
Thr Asn Ser 
365 

cgc ate aag 
Arg He Lys 



tac gee ccc 
Tyr Ala Pro 



ggc ctg ctg 
Gly Leu Leu 
415 

aag gag att 
Lys Glu He 
430 



aag 960 

Lys 

320 

ccc 1008 
Pro 



tgc 1056 
Cys 



gag 1104 
Glu 



cag 1152 
Gin 



ccc 1200 

Pro 

400 

ctg 1248 
Leu 



ttc 1296 
Phe 



cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 134 4 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gee ccc acc aag 1392 

Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 



gee aag cgc cgc gtg gtg cag cgc gag aa.g cgc gee tag 14 31 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala * 
465 470 475 



<210> 74 
<211> 476 
<212> PRT 

<213> Artificial Sequence 

<400> 74 
Ala Ser Ala Ala Asp Arg Leu Trp 

1 5 
Val Trp Lys Asp Ala Thr Thr Thr 
20 

Ala Tyr Asp Thr Glu Val His Asn 

35 40 
Pro Thr Asp Pro Asn Pro Gin Glu 



Val Thr Val Tyr Tyr Gly Val Pro 

10 15 

Leu Phe Cys Ala Ser Asp Ala Lys 
25 30 

Val Trp Ala Thr His Ala Cys Val 
45 

Val Val Leu Gly Asn Val Thr Glu 
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50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


lie 


lie 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


lie 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










14 0 










Phe 


Tyr 


Ser 


Leu 


Asp 


lie 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


-i-4-5 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


lie 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


lie 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


lie 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






1 95 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




OTA 

zlO 










215 










220 










Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


ZZd 










230 










235 










240 


lie 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 










24 5 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lys 


Ser 


lie 


His 


He 


Gly 


Pro 


Gly Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


lie 


lie 


Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 










325 










330 










335 




Glu 


lie 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








34 0 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






*3 c: c; 
Job 










360 










365 








Gly 


Asn 


lie 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




o / U 










37 5 










380 










lie 


lie 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


o o c 
Jo o 










390 










395 










400 


lie 


Gly 


Gly 


Gin 


lie 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










4 U d 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 


Ser 


Ser 


Gly 


Lys 


Glu 


lie 


Phe 








420 










425 










430 






Arg 


Pro 


Gly 


Gly 


Gly 


Asp 


Met 


Arg 


Asp 


Asn 


Trp 


Arg 


Ser 


Glu 


Leu 


Tyr 






435 










440 










445 








Lys 


Tyr 


Lys 


Val 


Val 


Lys 


He 


Glu 


Pro 


Leu 


Gly 


He 


Ala 


Pro 


Thr 


Lys 




450 










455 










460 










Ala 


Lys 


Arg 


Arg 


Val 


Val 


Gin 


Arg 


Glu 


Lys 


Arg 


Ala 











465 470 475 



<210> 75 
<211> 1038 
<212> DNA 

<213> Artificial Sequence 
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<220> 
<221> 
<222> 

<400> 
gcc gtg ggc 
Ala Val Gly 
1 

age acc atg 
Ser Thr Met 



ctg ctg age 
Leu Leu Ser 
35 

gag gcc cag 
Glu Ala Gin 
50 

etc cag gcc 
Leu Gin Ala 
65 

ttc ctg ggc 
Phe Leu Gly 



gtg ccc tgg 
Val Pro Trp 



gac aac atg 
Asp Asn Met 
115 

gag ate ate 
Glu He He 
130 

aac gag ctg 
Asn Glu Leu 
145 

tgg ttc aac 
Trp Phe Asn 



ate gtg ggc 
He Val Gly 



ate gtg aac 
He Val Asn 
195 

cgc ctg ccc 
Arg Leu Pro 



CDS 

(1) . . . (1038 
75 

ate ggc get 
He Gly Ala 
5 

ggc gcc gcc 
Gly Ala Ala 
20 

ggc ate gtg 
Gly He Val 



cag cac ctg 
Gin His Leu 



cgc gtg ctg 
Arg Val Leu 
70 

atg tgg ggc 
Met Trp Gly 
85 

aac gcc age 
Asn Ala Ser 
100 

acc tgg atg 
Thr Trp Met 



tac age ctg 
Tyr Ser Leu 



gac ctg etc 
Asp Leu Leu 
150 

ate acc aac 
lie Thr Asn 
165 

ggc ctg ate 
Gly Leu He 
180 

cgc gtg cgc 
Arg Val Arg 



gtg ccc cgc 
Val Pro Arg 



atg ttc etc 
Met Phe Leu 



age ctg acc 
Ser Leu Thr 
25 

cag cag cag 
Gin Gin Gin 
40 

etc cag ctg 
Leu Gin Leu 
55 

get eta gag 
Ala Leu Glu 



tgc tec ggc 
Cys Ser Gly 



tgg age aac 
Trp Ser Asn 
105 

gag tgg gag 
Glu Trp Glu 
120 

ate gag gag 
He Glu Glu 
135 

cag ctg gac 
Gin Leu Asp 



tgg ctg tgg 
Trp Leu Trp 



ggc ctg cgc 
Gly Leu Arg 
185 

cag ggc tac 
Gin Gly Tyr 
200 

ggc ccc gac 
Gly Pro Asp 



ggc ttc ctg 
Gly Phe Leu 
10 

ctg acc gtg 
Leu Thr Val 



aac aac ctg 
Asn Asn Leu 



acc gtg tgg 
Thr Val Trp 
60 

cgc tac etc 
Arg Tyr Leu 
75 

aag ctg ate 
Lys Leu He 
90 

aag aac ctg 
Lys Asn Leu 



cgc gag ate 
Arg Glu He 



age cag aac 
Ser Gin Asn 
140 

aag tgg gca 
Lys Trp Ala 
155 

tac ate aag 
Tyr He Lys 
170 

ate gtg ttc 
He Val Phe 



age ccc ctg 
Ser Pro Leu 



cgc ccc gag 
Arg Pro Glu 



ggc get gca 
Gly Ala Ala 
15 

cag gcc cgc 
Gin Ala Arg 
30 

ctg cgc gcc 
Leu Arg Ala 
45 

ggc ate aag 
Gly He Lys 



cag gac cag 
Gin Asp Gin 



tgc acc acg 
Cys Thr Thr 
95 

age cag att 
Ser Gin He 
110 

age aac tac 
Ser Asn Tyr 
125 

cag cag gag 
Gin Gin Glu 



age ttg tgg 
Ser Leu Trp 



att ttc ate 
He Phe He 
175 

acc gtg ctg 
Thr Val Leu 
190 

age ttc cag 
Ser Phe Gin 
205 

ggc ate gag 
Gly He Glu 



ggc 4 8 

Gly 



cag 9 6 

Gin 



ate 144 
He 



cag 192 
Gin 



cgc 240 
Arg 
80 

gcc 288 
Ala 



tgg 336 
Trp 



acc 384 
Thr 



aag 432 
Lys 



aac 480 

Asn 

160 

atg 528 
Met 



age 576 
Ser 



acc 624 
Thr 



gag 672 
Glu 
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210 215 220 

gag ggc ggc gag cgc gac cgc gac cgc age acc cgc ctg gtg acc ggc 720 

Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly 
225 230 235 240 

ttc ctg ccc ctg ate tgg gac gac ctg cgc age ctg ttc ctg ttc age 768 

Phe Leu Pro Leu lie Trp Asp Asp Leu Arg Ser Leu Phe Leu Phe Ser 
245 250 255 

tac cat cga ttg cgc gac ctg ctg ctg ate gtg gee cgc ate gtg gag 816 

Tyr His Arg Leu Arg Asp Leu Leu Leu lie Val Ala Arg lie Val Glu 

260 265 270 

ctg ctg ggc egg cgc ggc tgg gag ate ctg aag tac tgg tgg aac ctg 864 

Leu Leu Gly Arg Arg Gly Trp Glu lie Leu Lys Tyr Trp Trp Asn Leu 

275 280 285 

etc cag tac tgg age cag gag ctg aag aac tct gca gtg age ctg ctg 912 

Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu 
290 295 300 

aac gee acc gee ate gee gtg gee gag ggc acc gac cgc gtg ate gag 960 

Asn Ala Thr Ala lie Ala Val Ala Glu Gly Thr Asp Arg Val lie Glu 
305 310 315 320 

gtg gtg cag cgc ate tgg cgc ggc ate ctg cac ate ccc acc cga att 1008 

Val Val Gin Arg lie Trp Arg Gly lie Leu His lie Pro Thr Arg lie 
325 330 335 

cgc cag ggc ttc gag cgc gec ctg ctg taa 1038 

Arg Gin Gly Phe Glu Arg Ala Leu Leu * 

340 345 



<210> 76 
<211> 345 
<212> PRT 

<213> Artificial Sequence 
<400> 76 

Ala Val Gly He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly 

15 10 15 

Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin 

20 25 30 

Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

35 40 45 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin 

50 55 60 

Leu Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg 
65 70 75 80 

Phe Leu Gly Met Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala 

85 90 95 

Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp 

100 105 110 

Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr 

115 120 125 

Glu He He Tyr Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
130 135 140 
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Asn 


ulU 


Leu 


Asp 


Leu 


Leu 


Gin 


14 5 










1 RH 

1DU 




Trp 


rfie 


Asn 


lie 


l nr 


Asn 


Trp 










1DJ 






lie 


Veil 


Lii y 


LjI y 


Leu 


lie 


biy 








i ft n 

1 o u 








lie 


v ct 1 


As n 


Arg 


v a i 


Arg 


Gin 






ij j 










7\ >- rr 

/-\i y 


Lieu. 




\7=» i 
v ai 


Pro 


Arg 


Cjiy 




210 










01c: 
<c 1 o 


\jX LI 


uiy 


bi y 


Pin 

LilU 


Arg 


Asp 


Arg 


77 S 










9 ^ n 






Leu 


D 


Leu 


ire 


Trp 


Asp 
















Tyr 


ill o 


Arg 


Leu 


Arg 


Asp 


Leu 








2 60 








Leu 


L6U 


uiy 


Arg 


Arg 


(jjI y 


Trp 






275 










Leu 


Gin 


Tyr 


Trp 


Ser 


Gin 


Glu 




290 










295 


Asn 


Ala 


Thr 


Ala 


He 


Ala 


Val 


305 










310 




Val 


Val 


Gin 


Arg 


He 


Trp 


Arg 










325 






Arg 


Gin 


Gly 


Phe 


Glu 


Arg 


Ala 



340 



57 



Leu 


Asp 


Lys 


Trp 


Ala 


Ser 


Leu 








155 








Leu 


Trp 


Tyr 


He 


Lys 


He 


Phe 






170 










Leu 


Arg 


He 


Val 


Phe 


Thr 


Val 




185 










190 


Gly 


Tyr 


Ser 


Pro 


Leu 


Ser 


Phe 


200 










205 




Pro 


Asp 


Arg 


Pro 


Glu 


Gly 


He 










zzu 






Asp 


Arg 


Ser 


Thr 


Arg 


Leu 


Val 








235 








Asp 


Leu-Arg 


Ser 


Leu 


Phe 


Leu 






250 










Leu 


Leu 


He 


Val 


Ala 


Arg 


He 




265 












Glu 


He 


Leu 


Lys 


Tyr 


Trp 


Trp 


280 










285 




Leu 


Lys 


Asn 


Ser 


Ala 


Val 


Ser 










300 






Ala 


Glu 


Gly 


Thr 


Asp 


Arg 


Val 








315 








Gly 


He 


Leu 


His 


He 


Pro 


Thr 



330 



Leu Leu 
345 
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Trp Asn 
160 
He Met 
175 

Leu Ser 

Gin Thr 

Glu Glu 

Thr Gly 
240 
Phe Ser 
255 

Val Glu 

Asn Leu 

Leu Leu 

He Glu 
320 
Arg lie 
335 



